INDEX
Explanations
auxiliary verbs and pronouns
New Auto-Interp
Negative Logits
ear
-0.15
inkel
-0.15
Ear
-0.15
pii
-0.14
ĵĺ
-0.14
Witness
-0.13
гаÑĢан
-0.13
obili
-0.13
dishonest
-0.13
anske
-0.13
POSITIVE LOGITS
expectation
0.19
jang
0.18
expectations
0.18
expect
0.17
Expect
0.17
originally
0.16
expecting
0.16
actually
0.16
expect
0.15
Expect
0.15
Activations Density 0.008%