INDEX
Explanations
instances of the word "abe."
New Auto-Interp
Negative Logits
phis
-1.07
neapolis
-0.80
ivities
-0.79
iosity
-0.76
ophical
-0.74
Beir
-0.73
ivity
-0.73
speak
-0.72
stream
-0.71
prus
-0.71
POSITIVE LOGITS
zz
1.01
legates
0.86
legate
0.82
ñ
0.81
ça
0.81
cki
0.80
cca
0.77
Ca
0.77
FORE
0.77
1981
0.76
Activations Density 0.010%