INDEX
Explanations
phrases involving expressing strong emotions or judgments towards others
second-person pronouns emphasizing direct address or accusations toward the reader
New Auto-Interp
Negative Logits
icy
-0.70
ice
-0.67
¿½
-0.66
ftime
-0.65
assembly
-0.64
airs
-0.64
FEC
-0.60
UTC
-0.60
acular
-0.59
Above
-0.58
POSITIVE LOGITS
're
1.57
've
1.34
'll
1.31
'd
1.16
tub
1.13
guys
1.08
know
0.92
owe
0.91
guessed
0.89
RS
0.89
Activations Density 0.240%