INDEX
Explanations
phrases related to being caught in a negative or incriminating situation
instances of the word "caught"
New Auto-Interp
Negative Logits
issance
-0.65
study
-0.65
ukong
-0.64
guided
-0.64
olas
-0.64
VERTISEMENT
-0.61
enthal
-0.61
ceremonies
-0.61
profits
-0.60
urai
-0.59
POSITIVE LOGITS
unprepared
1.13
cheating
0.91
plagiar
0.91
stealing
0.91
tampering
0.90
unaware
0.90
violating
0.75
sneaking
0.74
gling
0.74
dere
0.74
Activations Density 0.026%