INDEX
Explanations
Latin and foreign language words or phrases
occurrences of a specific character or symbol
New Auto-Interp
Negative Logits
mathemat
-0.79
disadvant
-0.75
awaru
-0.72
carbohyd
-0.71
soph
-0.71
merce
-0.70
contrace
-0.66
efficients
-0.64
Wichita
-0.64
scholarly
-0.63
POSITIVE LOGITS
ï¸ı
1.15
女
0.91
士
0.84
éĩ
0.79
ï¸
0.77
istg
0.76
çľ
0.76
tu
0.76
æľ
0.75
äº
0.75
Activations Density 0.316%