INDEX
Explanations
phrases related to legal terms and restrictions
generic terms and phrases related to permissions and limitations
New Auto-Interp
Negative Logits
rea
-0.77
rex
-0.75
ÃŁ
-0.73
rez
-0.72
plex
-0.67
staking
-0.66
seless
-0.65
ros
-0.64
eps
-0.63
expensive
-0.63
POSITIVE LOGITS
THING
1.20
WHERE
1.07
conceivable
1.01
particular
0.95
other
0.93
body
0.93
ONE
0.91
sort
0.83
where
0.82
kind
0.80
Activations Density 0.082%