INDEX
Explanations
phrases directed at the reader with a varying overall tone
references to the second person, specifically addressing the reader or listener directly
New Auto-Interp
Negative Logits
ice
-0.73
assembly
-0.70
icy
-0.69
¿½
-0.67
ftime
-0.64
î
-0.63
Samoa
-0.61
airs
-0.60
UTC
-0.60
Commerce
-0.59
POSITIVE LOGITS
're
1.52
've
1.27
'll
1.24
'd
1.07
guys
1.00
tub
1.00
owe
0.91
know
0.90
gotta
0.88
RS
0.87
Activations Density 0.260%