INDEX
Explanations
mentions of specific people and organizations, potentially related to biographies or political statements
the names of people or entities associated with specific contexts or events
New Auto-Interp
Negative Logits
occasion
-0.80
Cher
-0.73
Atlas
-0.71
direction
-0.71
Gran
-0.70
Aux
-0.68
Mus
-0.68
Evangel
-0.68
reserves
-0.68
Civil
-0.67
POSITIVE LOGITS
Bi
2.28
Po
2.13
Ho
2.11
Va
1.96
Ma
1.85
Lo
1.66
Li
1.64
Ha
1.63
Hu
1.63
Sa
1.56
Activations Density 0.043%