INDEX
Explanations
phrases indicating absence or negation
"nothing" followed by a qualifier
New Auto-Interp
Negative Logits
evt
-0.56
PDT
-0.54
odyear
-0.51
MST
-0.49
epo
-0.48
LDAP
-0.47
Wpf
-0.47
ffindor
-0.47
Patriot
-0.47
ujednoznacz
-0.46
POSITIVE LOGITS
nothing
1.05
Nothing
0.99
nothing
0.98
NOTHING
0.97
NOTHING
0.95
Nothing
0.94
nothin
0.87
Nada
0.73
THING
0.72
nada
0.69
Activations Density 0.010%