INDEX
Explanations
keywords related to interrogations
terms related to interrogation and questioning processes
New Auto-Interp
Negative Logits
Offline
-0.73
spare
-0.67
minecraft
-0.66
ulz
-0.65
aim
-0.63
fortune
-0.63
cakes
-0.62
jri
-0.62
buy
-0.62
ensical
-0.62
POSITIVE LOGITS
interrog
1.23
interrogation
1.17
interrogated
1.01
Techniques
0.86
techniques
0.84
questioning
0.81
isen
0.77
probing
0.76
torture
0.76
tactics
0.74
Activations Density 0.018%