INDEX
Explanations
expressions related to holiday greetings and well-wishes
specific formatting or special characters in the text
New Auto-Interp
Negative Logits
targeted
-0.71
permitted
-0.69
advisory
-0.69
delegation
-0.69
proposed
-0.68
Wilmington
-0.68
Libyan
-0.68
narrowly
-0.68
succession
-0.67
Tripoli
-0.67
POSITIVE LOGITS
ï¸ı
1.60
ðŁĺ
1.13
¯
1.13
âĻ
1.11
âĿ
1.07
Ô
1.07
ðŁ
1.01
âĶģ
1.01
shit
0.98
ï¸
0.97
Activations Density 0.281%