Named entity recognition
- Croatian 3-class model (hr.3-class.distsim.ser.gz), measured F1 is 0.899
- Croatian 4-class model (hr.4-class.distsim.ser.gz) trained on a small subset of available data annotated with all four classes, measured F1 is 0.636
- Slovene 4-class model (sl.distsim.ser.gz), measured F1 is 0.7
Please cite this paper when using the models:
Ljubešić, N.; Stupar, M.; Jurić, T.; Agić, Ž. (2013.) Combining Available Datasets for Building Named Entity Recognition Models of Croatian and Slovene. In Slovenščina 2.0: empirical, applied and interdisciplinary research, in press.