INDEX
Explanations
the word "affer" with different variations and contexts
references to specific individuals or public figures
New Auto-Interp
Negative Logits
supers
-0.66
Ho
-0.65
stone
-0.64
Ore
-0.62
Ind
-0.62
moons
-0.62
token
-0.62
bye
-0.61
NS
-0.60
together
-0.60
POSITIVE LOGITS
affer
4.81
antz
1.16
acus
1.03
aff
1.00
pherd
0.95
irlf
0.90
affe
0.88
apter
0.87
afer
0.87
uggage
0.87
Activations Density 0.016%