INDEX
Explanations
personal pronouns 'it' followed by verbs or verb phrases
the pronoun "it" used to refer to entities in various contexts
New Auto-Interp
Negative Logits
Anarchy
-0.72
Breast
-0.69
Toast
-0.67
Orn
-0.64
Priv
-0.64
Socket
-0.63
ppa
-0.63
Torn
-0.63
Flavoring
-0.62
Warm
-0.62
POSITIVE LOGITS
alian
1.12
self
1.06
asca
1.01
unes
0.95
chy
0.90
henko
0.87
beh
0.79
ueller
0.78
chwitz
0.77
ÃĥÃĤ
0.77
Activations Density 0.374%