INDEX
Explanations
reflexive pronouns commonly used when describing actions related to accusations or blame
references to accusations or claims against individuals
New Auto-Interp
Negative Logits
NOR
-0.65
endor
-0.64
WOR
-0.55
nov
-0.53
atform
-0.52
Brav
-0.52
adolesc
-0.52
nan
-0.52
Cowboy
-0.51
iants
-0.51
POSITIVE LOGITS
of
1.28
of
1.27
thereof
1.06
Of
1.05
Of
0.95
OF
0.92
OF
0.87
ta
0.72
ensor
0.65
oft
0.62
Activations Density 0.360%