INDEX
Explanations
instances of the word "platform"
references to political platforms
New Auto-Interp
Negative Logits
ospital
-0.68
idan
-0.66
risome
-0.63
Ravens
-0.62
utenberg
-0.61
umph
-0.61
nexus
-0.61
ENE
-0.61
Kard
-0.60
ude
-0.59
POSITIVE LOGITS
plank
0.95
speeches
0.88
etter
0.83
platform
0.82
holder
0.81
ngth
0.78
holders
0.77
onite
0.76
manship
0.76
eering
0.75
Activations Density 0.009%