INDEX
Explanations
proper nouns related to organizations and people
acronyms and abbreviations related to organizations or industries
New Auto-Interp
Negative Logits
catentry
-0.82
iary
-0.70
¯¯¯¯¯¯¯¯
-0.69
bern
-0.67
speedy
-0.66
hold
-0.64
Allaah
-0.64
compan
-0.63
surpr
-0.62
ãĥĥãĥī
-0.62
POSITIVE LOGITS
FU
0.98
SE
0.96
HY
0.96
HS
0.94
KI
0.92
SA
0.91
OP
0.91
OPS
0.89
ESH
0.89
dit
0.89
Activations Density 0.141%