INDEX
Explanations
phrases or sentences indicating approval or authorization
the term "signed off" in various contexts, indicating approval or agreement
New Auto-Interp
Negative Logits
liter
-0.69
jad
-0.67
Rowling
-0.66
anan
-0.65
Ung
-0.62
Faul
-0.61
istor
-0.61
oved
-0.61
ophile
-0.60
Gall
-0.60
POSITIVE LOGITS
cffffcc
0.84
shoot
0.75
MENTS
0.73
tank
0.71
rection
0.71
ussions
0.69
¯¯
0.68
WARNING
0.67
points
0.63
igious
0.63
Activations Density 0.039%