INDEX
Explanations
phrases related to legal and political controversies
New Auto-Interp
Negative Logits
erest
-0.62
mum
-0.57
ling
-0.56
mall
-0.55
level
-0.54
jured
-0.54
comfort
-0.53
ricular
-0.53
earch
-0.52
lull
-0.52
POSITIVE LOGITS
including
1.07
which
1.03
something
1.02
meaning
0.97
particularly
0.97
perhaps
0.96
especially
0.95
namely
0.94
possibly
0.93
even
0.93
Activations Density 6.879%