INDEX
Explanations
mentions of the word "af"
repeated mentions of the suffix "af"
New Auto-Interp
Negative Logits
Olymp
-0.67
Dru
-0.67
fixation
-0.66
Boone
-0.65
Hulk
-0.65
Wink
-0.64
MLG
-0.63
Creator
-0.63
Patriarch
-0.62
paraly
-0.60
POSITIVE LOGITS
rican
1.41
rica
1.30
ayette
1.21
avorite
1.11
amily
1.10
rost
1.06
ornia
1.04
eatures
1.04
raid
1.04
onso
1.04
Activations Density 0.016%