Conversation
|
Note: The failures are not because of my code but because of how the tests are set up. Edit: Fixed now |
|
Less fp for the mostly-img reason would be great, but I don't think excluding sites is optimal approach. See #4190 |
MathJax would still get caught though |
I have updated the PR and it has nothing to do with SO now, also the issue is MathJax posted as images and not actual MathJax. |
<img>code counted and the other 3 have a lot of MathJax used)Statistics:
Excluding StackOverflow, Maths, Mathoverflow and Cross Validated from the "post is mostly images" reason, will result in:
The current accuracy of this reason is 17% (17)
New accuracy: 40% (40)
Excluding Cross Validated from the "mostly punctuation marks in {}" reason, will result in: