INDEX
Explanations
phrases related to lifting or removing restrictions
terms related to lifting restrictions or bans
New Auto-Interp
Negative Logits
present
-0.71
lished
-0.70
errilla
-0.70
ãĥ£
-0.68
Palin
-0.64
915
-0.63
Analy
-0.62
RAL
-0.61
TAG
-0.60
clave
-0.60
POSITIVE LOGITS
lift
1.07
weights
1.01
lifted
1.00
lifting
1.00
lifts
0.92
lift
0.88
hens
0.84
ĸļ
0.79
tremend
0.79
weight
0.78
Activations Density 0.017%