INDEX
Explanations
words related to physical barriers or limitations
occurrences of the word "gate" and its context
New Auto-Interp
Negative Logits
ortium
-0.95
Hots
-0.83
ensional
-0.80
enegger
-0.79
issance
-0.74
lihood
-0.73
encers
-0.67
inho
-0.67
TING
-0.66
ocker
-0.66
POSITIVE LOGITS
keepers
1.37
keeper
1.32
ways
1.12
keeping
1.04
posts
1.01
gates
1.01
stones
0.98
house
0.97
gate
0.93
staff
0.90
Activations Density 0.016%