INDEX
Explanations
references to physical structures and barriers, especially in the context of confinement or protection
New Auto-Interp
Negative Logits
квартира
-0.34
ujednoznacz
-0.33
icônes
-0.33
enterOuterAlt
-0.33
geber
-0.33
rasser
-0.33
Crusoe
-0.33
Dorf
-0.33
ressible
-0.32
➋
-0.32
POSITIVE LOGITS
fence
2.14
fences
2.00
Fence
1.81
fencing
1.78
fence
1.73
Fence
1.64
gates
1.49
Fencing
1.49
barrier
1.47
gate
1.45
Activations Density 0.250%