INDEX
Explanations
words related to voting
occurrences of the token "ote."
New Auto-Interp
Negative Logits
slow
-0.69
Slow
-0.64
uninsured
-0.64
uneven
-0.60
Furious
-0.60
unexplained
-0.60
frustrations
-0.59
Lat
-0.59
riots
-0.59
divorce
-0.59
POSITIVE LOGITS
ote
4.53
otes
2.98
OTE
2.00
oting
1.89
oted
1.81
otal
1.58
ota
1.53
otic
1.45
ot
1.40
oters
1.31
Activations Density 0.010%