INDEX
Explanations
phrases related to legal or political discussions about detention and security
Punctuation followed by specific words
English words and punctuation
New Auto-Interp
Negative Logits
(~
-0.70
(&
-0.68
(‘
-0.66
(
-0.66
🙂
-0.64
Whilst
-0.63
xD
-0.63
Whilst
-0.59
:)
-0.59
fuck
-0.58
POSITIVE LOGITS
isolé
0.79
ieri
0.71
livré
0.70
مرئيه
0.70
>>
0.69
scattata
0.69
VIDEOTAPE
0.68
♪
0.67
morire
0.66
gruesa
0.66
Activations Density 0.026%