INDEX
Explanations
mentions of specific symbols or characters
specific symbols or characters that may indicate special formatting or emotional emphasis in text
New Auto-Interp
Negative Logits
unidentified
-0.64
ATM
-0.63
mount
-0.63
Aur
-0.62
intens
-0.62
sund
-0.62
manager
-0.61
mont
-0.61
range
-0.60
olar
-0.60
POSITIVE LOGITS
Ĺ
4.59
ĸ
2.10
ķ
1.82
ĺ
1.81
ļ
1.74
ĥ
1.73
¤
1.72
¦
1.72
ĵ
1.70
§
1.66
Activations Density 0.004%