INDEX
Explanations
the pronoun "it" in various contexts
New Auto-Interp
Negative Logits
emen
-0.17
itto
-0.16
sure
-0.16
óz
-0.15
STRACT
-0.15
-Ñı
-0.14
глÑı
-0.14
گرÛĮ
-0.14
Needed
-0.14
κοÏģ
-0.14
POSITIVE LOGITS
beh
0.34
pays
0.28
important
0.27
pay
0.23
Pay
0.23
important
0.22
helps
0.21
vital
0.21
Pays
0.20
Pay
0.20
Activations Density 0.116%