INDEX
Explanations
phrases and words related to physical boundaries or limitations
New Auto-Interp
Negative Logits
бо
-0.15
ÙĪØ±Ø²
-0.14
иÑĢов
-0.14
maxlength
-0.14
angular
-0.14
heimer
-0.14
ibble
-0.14
asio
-0.14
WARDED
-0.14
ì§Ģ를
-0.13
POSITIVE LOGITS
pin
0.39
pins
0.32
pin
0.32
pins
0.29
pinned
0.28
lie
0.27
-pin
0.26
Pin
0.26
Pin
0.26
SCORE
0.25
Activations Density 0.018%