INDEX
Explanations
references to physical locations or areas
references to various types of spaces in contexts of safety and accessibility
New Auto-Interp
Negative Logits
bane
-0.76
razen
-0.70
Browne
-0.67
od
-0.65
OPEC
-0.63
oner
-0.63
cartel
-0.62
hetamine
-0.61
iren
-0.61
mitter
-0.60
POSITIVE LOGITS
spaces
3.92
Spaces
2.59
space
2.44
space
1.93
paces
1.68
SPACE
1.68
Space
1.56
rooms
1.44
spac
1.44
environments
1.44
Activations Density 0.016%