INDEX
Explanations
instances of the word "for" and related prepositions indicating purpose or reason
New Auto-Interp
Negative Logits
ippy
-0.16
ãĥ¥ãĥ¼
-0.16
elho
-0.15
ilon
-0.14
olec
-0.14
oxel
-0.14
çĿĽ
-0.14
ÙİØ§ÙĨ
-0.14
_ISS
-0.14
ulle
-0.14
POSITIVE LOGITS
angler
0.17
veau
0.14
scal
0.14
AMY
0.14
rib
0.14
table
0.14
assy
0.14
fo
0.14
aq
0.14
trib
0.14
Activations Density 0.010%