INDEX
Explanations
dates or events represented in the format Day/Month
occurrences of a specific symbol or character throughout the text
New Auto-Interp
Negative Logits
constitu
-0.85
imitation
-0.83
disadvant
-0.83
vulner
-0.74
mathemat
-0.73
carbohyd
-0.73
blender
-0.72
promoters
-0.72
tangled
-0.68
levers
-0.68
POSITIVE LOGITS
ï¸ı
1.21
ï¸
1.00
Ļ
0.91
İ
0.88
ħ
0.87
ا
0.85
女
0.84
ATH
0.83
PT
0.81
ļ
0.81
Activations Density 0.458%