INDEX
Explanations
mentions of nonprofits and organizations
acronyms or abbreviations
New Auto-Interp
Negative Logits
familiar
-0.67
unsupported
-0.66
embell
-0.64
given
-0.63
Geral
-0.63
fib
-0.63
uncertain
-0.62
linen
-0.61
Bosnia
-0.61
ILCS
-0.61
POSITIVE LOGITS
td
0.89
yne
0.89
uty
0.84
agonist
0.84
IDs
0.84
tarians
0.83
ifter
0.83
undle
0.82
eeper
0.82
eed
0.81
Activations Density 0.094%