INDEX
Explanations
phrases related to surveillance or eavesdropping
terms related to surveillance and monitoring activities
New Auto-Interp
Negative Logits
prayers
-0.71
oil
-0.65
glossy
-0.64
naming
-0.63
Mahjong
-0.62
friendship
-0.62
tongues
-0.61
tun
-0.60
heats
-0.60
grill
-0.60
POSITIVE LOGITS
dropping
2.78
drop
2.46
dro
2.35
drops
1.91
second
1.78
played
1.50
seconds
1.10
perse
1.08
Drop
1.06
conscious
0.95
Activations Density 0.030%