INDEX
Explanations
words related to political figures or entities
mentions of "legislation" or related terms
New Auto-Interp
Negative Logits
âķIJâķIJ
-0.91
Kindle
-0.71
à¨
-0.66
hower
-0.65
aukee
-0.63
Learns
-0.63
ãģ¦
-0.62
ciation
-0.61
AIR
-0.61
ï¸
-0.61
POSITIVE LOGITS
itimate
1.33
isl
1.28
Leg
1.10
uin
1.05
acies
1.04
Leg
1.04
leg
0.93
acy
0.92
lore
0.88
busters
0.84
Activations Density 0.011%