INDEX
Explanations
patterns that start with special characters and involve a sense or perception
special characters or glyphs that may indicate a certain tone or emphasis in the text
New Auto-Interp
Negative Logits
disse
-0.79
seiz
-0.79
ãĥ¯ãĥ³
-0.76
snail
-0.71
obser
-0.71
Franch
-0.70
scor
-0.68
hemor
-0.65
dehuman
-0.64
icit
-0.63
POSITIVE LOGITS
Ŀ
1.58
¡
1.19
Ĵ
1.01
ľ
0.98
ī
0.98
¤
0.97
Ĩ
0.95
Ķ
0.94
¦
0.94
ĺ
0.92
Activations Density 0.299%