Friday, January 3, 2020

Pleasant Surprise in December
























I was looking for ways of doing NER (Named Entity Recognition) in Portuguese and discovered that the spaCy guys have developed an open-source NER system for Portuguese  using our work on Universal Dependencies for Portuguese (paper in https://www.aclweb.org/anthology/W17-6523/)

This was a great surprise! I love it!!!

I worry as I don't think the corpus Bosque is good enough. It is small and somewhat old and full of mistakes, but still, the numbers for NER are very good:

NER ACCURACY
NER F 89.18
NER Precision 89.32
NER Recall 89.03

No comments:

Post a Comment