I was looking for ways of doing NER (Named Entity Recognition) in Portuguese and discovered that the spaCy guys have developed an open-source NER system for Portuguese using our work on Universal Dependencies for Portuguese (paper in https://www.aclweb.org/anthology/W17-6523/)
This was a great surprise! I love it!!!
I worry as I don't think the corpus Bosque is good enough. It is small and somewhat old and full of mistakes, but still, the numbers for NER are very good:
| NER ACCURACY | |
|---|---|
| NER F | 89.18 |
| NER Precision | 89.32 |
| NER Recall | 89.03 |

No comments:
Post a Comment