INDEX
Explanations
the word "shadow" followed by another word
references to "shadow" roles in a political context
New Auto-Interp
Negative Logits
urses
-0.86
artney
-0.85
anchester
-0.85
ickr
-0.83
keye
-0.79
awaru
-0.78
renheit
-0.76
otide
-0.76
ourse
-0.71
ancock
-0.71
POSITIVE LOGITS
moon
1.02
boxing
0.97
flame
0.89
loo
0.86
runners
0.83
hawk
0.81
shadow
0.80
Shadow
0.80
shadow
0.76
lights
0.76
Activations Density 0.025%