INDEX
Explanations
triggers related to exclamations and emotional expressions
repeated characters or sequences that indicate a formatting or encoding issue
New Auto-Interp
Negative Logits
Shant
-0.70
Tid
-0.70
photoc
-0.69
mete
-0.67
Shap
-0.65
seiz
-0.64
Xan
-0.64
horizont
-0.63
Synd
-0.63
Drawn
-0.61
POSITIVE LOGITS
ķ
1.15
«
1.11
Ŀ
1.06
Ĵ
1.04
Ń
1.04
ĸ
1.03
¬
1.03
´
1.03
ĵ
1.02
ª
1.02
Activations Density 0.118%