INDEX
Explanations
the word "Gh*hi" in reference to a specific person or historical figure
occurrences of the word "hi" in various contexts
New Auto-Interp
Negative Logits
argon
-0.75
ilater
-0.73
lain
-0.72
Cosponsors
-0.69
sylv
-0.68
Aven
-0.68
ividual
-0.67
icist
-0.65
swick
-0.64
nesday
-0.64
POSITIVE LOGITS
ya
1.01
emi
0.97
pper
0.92
yy
0.88
wa
0.85
oga
0.84
emen
0.84
yah
0.83
uli
0.83
ELD
0.83
Activations Density 0.009%