INDEX
Explanations
mentions of Nigeria and its derivatives
New Auto-Interp
Negative Logits
ought
-0.15
avax
-0.15
_sg
-0.15
etsk
-0.14
ipse
-0.14
uggage
-0.14
ÙģØ§Ø±
-0.14
ERG
-0.14
enticate
-0.13
Ñħови
-0.13
POSITIVE LOGITS
Delta
0.18
lum
0.16
delta
0.16
ati
0.15
wins
0.15
ischer
0.15
/problem
0.15
okoj
0.14
Twin
0.14
/problems
0.14
Activations Density 0.007%