INDEX
Explanations
the word "by" indicating action or agency in sentences
New Auto-Interp
Negative Logits
voj
-0.06
allen
-0.06
ussen
-0.06
nts
-0.06
aight
-0.06
ught
-0.06
æĺ¯åIJ¦
-0.06
isize
-0.06
ughter
-0.06
izoph
-0.06
POSITIVE LOGITS
edException
0.08
opic
0.07
ared
0.07
alion
0.07
iba
0.06
/slick
0.06
PEC
0.06
rost
0.06
olan
0.06
idi
0.06
Activations Density 0.054%