INDEX
Explanations
conjunctions indicating combinations or connections between ideas or entities
New Auto-Interp
Negative Logits
1
-0.58
2
-0.52
if
-0.51
сь
-0.50
.
-0.49
while
-0.48
,
-0.48
croix
-0.47
if
-0.47
ed
-0.46
POSITIVE LOGITS
autorytatywna
1.15
OGND
1.07
Portale
1.04
AssemblyTitle
0.95
Efq
0.94
省市镇
0.94
esterday
0.93
سكانية
0.90
^(@)
0.89
للمعارف
0.88
Activations Density 0.124%