INDEX
Explanations
proper nouns related to political figures or organizations
the presence of empty or non-informative content
New Auto-Interp
Negative Logits
Andersen
-0.62
example
-0.58
ener
-0.56
schild
-0.56
Ïī
-0.56
Lone
-0.55
Siber
-0.55
bec
-0.55
phy
-0.55
blers
-0.54
POSITIVE LOGITS
lawmakers
0.83
lawmaker
0.81
BJP
0.80
deputy
0.79
officials
0.77
Secretary
0.77
approves
0.76
secretary
0.74
Says
0.74
accuses
0.74
Activations Density 0.340%