INDEX
Explanations
mentions of wheeled objects
mentions of specific names and terms related to governance and politics
New Auto-Interp
Negative Logits
crawl
-0.87
Ops
-0.75
Lumpur
-0.75
ized
-0.68
ocular
-0.67
aminer
-0.67
gallery
-0.66
izers
-0.65
ops
-0.64
drawer
-0.64
POSITIVE LOGITS
er
1.10
ership
0.99
eling
0.93
elines
0.90
esome
0.90
ered
0.90
ences
0.90
pton
0.88
arse
0.87
ance
0.87
Activations Density 0.074%