INDEX
Explanations
the term "so" used for emphasis
New Auto-Interp
Negative Logits
713
-0.16
antry
-0.16
ammo
-0.15
Resolution
-0.14
dig
-0.14
Alv
-0.14
Criterion
-0.14
astle
-0.14
Indices
-0.14
NSK
-0.13
POSITIVE LOGITS
aring
0.31
-called
0.27
ho
0.26
jour
0.26
apy
0.25
aking
0.25
Cal
0.25
zial
0.24
Ho
0.24
aps
0.23
Activations Density 0.024%