INDEX
Explanations
phrases indicating appreciation or evoking curiosity about potential outcomes
New Auto-Interp
Negative Logits
when
-0.61
When
-0.55
M
-0.52
-0.51
P
-0.49
FROM
-0.48
F
-0.47
M
-0.47
would
-0.47
Even
-0.46
POSITIVE LOGITS
مرئيه
1.05
InjectAttribute
1.05
للاسماء
1.00
存于互联网档案馆
0.96
تانيه
0.93
ویکیپدی
0.93
виправивши
0.92
ThroughAttribute
0.92
ISTAT
0.90
Normdatei
0.89
Activations Density 0.207%