INDEX
Explanations
phrases related to barriers or restrictions
phrases indicating barriers or restrictions
New Auto-Interp
Negative Logits
tune
-0.81
tunes
-0.71
requency
-0.68
enos
-0.65
ovie
-0.65
fare
-0.61
acus
-0.61
oice
-0.60
rative
-0.60
MON
-0.60
POSITIVE LOGITS
airspace
0.93
Borders
0.89
borders
0.85
confines
0.85
perimeter
0.83
porous
0.79
encl
0.76
agall
0.76
taboola
0.70
secrecy
0.70
Activations Density 0.317%