INDEX
Explanations
the pronoun "he" in various contexts
New Auto-Interp
Negative Logits
اÙĦ
-0.65
Gems
-0.64
Interest
-0.64
pregn
-0.62
Veter
-0.62
Royale
-0.61
Suzanne
-0.59
DIV
-0.58
internationally
-0.57
Interested
-0.57
POSITIVE LOGITS
eded
1.32
eding
1.26
ctor
1.17
aped
1.13
zbollah
1.09
aving
1.09
aps
1.06
uristic
1.06
ctic
1.05
aping
1.02
Activations Density 0.156%