INDEX
Explanations
occurrences of the word "in" and its various forms
New Auto-Interp
Negative Logits
ume
-0.15
anga
-0.15
ering
-0.14
Ë
-0.14
ÑĢеÑī
-0.13
ãĤ¦ãĥĪ
-0.13
arga
-0.13
é«ĺæ¸ħ
-0.13
ard
-0.13
mas
-0.13
POSITIVE LOGITS
remarks
0.27
statements
0.27
comments
0.26
statement
0.23
interviews
0.22
separate
0.20
response
0.20
entrev
0.20
remarks
0.19
statements
0.19
Activations Density 0.093%