INDEX
Explanations
instances of the word "eff" or variations of it
variations of the word "effort."
New Auto-Interp
Negative Logits
meal
-0.70
oath
-0.69
²¾
-0.65
Spur
-0.64
shortened
-0.64
SHIP
-0.63
Downloadha
-0.63
BOOK
-0.62
ciating
-0.61
uninterrupted
-0.61
POSITIVE LOGITS
orts
1.50
luent
1.44
erves
1.34
endi
1.25
emin
1.23
usive
1.20
lore
1.19
ort
1.16
iency
1.16
usion
1.15
Activations Density 0.035%