INDEX
Explanations
proper nouns related to places, names, and organizations
New Auto-Interp
Negative Logits
izoph
-0.70
ADRA
-0.66
Tokens
-0.62
Scotland
-0.60
overwhelming
-0.60
Xi
-0.60
Compared
-0.59
prohib
-0.58
apex
-0.58
unfavorable
-0.58
POSITIVE LOGITS
Jr
1.54
III
1.20
Sr
1.16
berger
1.11
baum
1.11
oglu
1.04
owski
1.03
iewicz
1.03
QC
1.03
ovich
1.03
Activations Density 1.032%