INDEX
Explanations
specific names and terms related to a political or news context
occurrences of specific names and identifiers within the text
New Auto-Interp
Negative Logits
hered
-0.87
ted
-0.77
self
-0.75
TPS
-0.74
sheet
-0.69
ting
-0.69
tf
-0.68
FW
-0.68
Insect
-0.67
ext
-0.66
POSITIVE LOGITS
aji
1.19
pora
1.03
ptions
0.94
ajor
0.92
veyard
0.89
ision
0.86
oti
0.84
Ú
0.84
Angelo
0.84
agara
0.83
Activations Density 0.009%