INDEX
Explanations
terms related to entry or access points
references to "gate" in various contexts
New Auto-Interp
Negative Logits
Hots
-0.81
ortium
-0.74
TING
-0.70
arus
-0.69
ity
-0.68
lihood
-0.67
ensional
-0.66
issance
-0.65
ynasty
-0.65
ied
-0.64
POSITIVE LOGITS
keeper
1.25
keepers
1.24
ways
1.20
posts
0.98
way
0.97
fold
0.94
keeping
0.90
stones
0.89
house
0.87
hole
0.83
Activations Density 0.030%