INDEX
Explanations
instances of the word "Ghomi"
repeated mentions of the word "hi."
New Auto-Interp
Negative Logits
lain
-0.80
nesday
-0.77
swick
-0.71
argon
-0.71
ividual
-0.68
orative
-0.68
tle
-0.68
ktop
-0.66
rations
-0.66
atility
-0.64
POSITIVE LOGITS
emi
1.02
ya
1.01
yy
0.92
kson
0.88
yah
0.87
pper
0.87
veland
0.86
ELD
0.84
0.84
pping
0.83
Activations Density 0.019%