INDEX
Explanations
instances of the word "It" followed by related statements or phrases
New Auto-Interp
Negative Logits
ãĥ¼ãĥ©
-0.15
ihan
-0.15
ÅĦ
-0.14
lez
-0.14
görmek
-0.13
quina
-0.13
.sul
-0.13
uÅŁ
-0.13
body
-0.13
arness
-0.13
POSITIVE LOGITS
alia
0.21
ching
0.20
alo
0.17
alc
0.17
ald
0.17
ancell
0.16
semb
0.15
seems
0.15
zel
0.14
alien
0.14
Activations Density 0.087%