INDEX
Explanations
political terms or phrases
terms related to political contexts or motivations
New Auto-Interp
Negative Logits
Chiefs
-0.78
Vikings
-0.74
Rams
-0.73
Seasons
-0.72
Jackets
-0.71
Isle
-0.70
Cannes
-0.70
Royals
-0.70
Rune
-0.70
Viking
-0.68
POSITIVE LOGITS
speaking
0.89
correct
0.89
motivated
0.81
affili
0.81
incorrect
0.81
minded
0.80
handic
0.79
speaking
0.75
engineered
0.74
advant
0.73
Activations Density 0.007%