INDEX
Explanations
items or entities, typically in the context of a list or categorization
Tokens after abbreviations/initials
country codes
New Auto-Interp
Negative Logits
not
-0.67
não
-0.61
doesn
-0.59
nicht
-0.58
не
-0.57
tidak
-0.53
de
-0.53
didn
-0.52
isn
-0.52
niet
-0.52
POSITIVE LOGITS
ModelExpression
1.10
Personensuche
1.10
raiſ
1.07
Efq
1.06
Monfieur
1.05
NameInMap
1.03
itſelf
1.01
Theſe
1.00
OGND
0.99
nakalista
0.99
Activations Density 0.072%