INDEX
Explanations
phrases related to barriers and obstacles
New Auto-Interp
Negative Logits
allet
-0.15
SCAN
-0.15
asket
-0.14
isd
-0.14
venes
-0.14
iske
-0.13
åľŃ
-0.13
onth
-0.13
å£
-0.13
Horton
-0.13
POSITIVE LOGITS
ikal
0.15
/bar
0.15
imped
0.15
barriers
0.15
/block
0.15
113
0.15
578
0.15
343
0.14
obstacles
0.14
:block
0.14
Activations Density 0.066%