INDEX
Explanations
mentions of specific organizations or political entities
references to journalistic content and related terms
New Auto-Interp
Negative Logits
DU
-0.83
Bren
-0.81
DRAG
-0.80
brid
-0.79
DAC
-0.76
Ellen
-0.76
EAR
-0.76
CY
-0.76
Brid
-0.75
Compos
-0.74
POSITIVE LOGITS
ist
1.46
ists
1.36
ism
1.29
IST
1.20
iste
1.13
istic
1.08
ista
1.06
ISM
1.00
ismo
1.00
istani
0.98
Activations Density 0.264%