INDEX
Explanations
references to rebellion against authority or oppressive regimes
New Auto-Interp
Negative Logits
LEC
-0.15
jadx
-0.15
863
-0.15
862
-0.15
linky
-0.14
acements
-0.14
rost
-0.14
cete
-0.14
izzo
-0.14
stagger
-0.14
POSITIVE LOGITS
Lot
0.17
town
0.16
lot
0.15
neighbours
0.15
towns
0.15
Lottery
0.15
narr
0.15
-town
0.14
Everybody
0.14
彩票
0.14
Activations Density 0.000%