INDEX
Explanations
phrases or words related to variations or differences
New Auto-Interp
Negative Logits
DEF
-0.76
ãĤ¸
-0.74
iron
-0.69
NN
-0.67
çīĪ
-0.66
ainment
-0.66
lest
-0.65
Otherwise
-0.64
Li
-0.63
WARN
-0.63
POSITIVE LOGITS
differing
1.21
different
1.14
varying
1.10
depending
1.09
severity
1.01
geographically
0.99
variations
0.98
different
0.97
variation
0.95
geographic
0.93
Activations Density 0.250%