INDEX
Explanations
words related to physical boundaries or limits
terms related to boundaries or limits
New Auto-Interp
Negative Logits
orah
-0.84
ondo
-0.81
jury
-0.78
fortune
-0.77
ocker
-0.77
enza
-0.74
ulz
-0.73
kus
-0.73
obe
-0.73
milo
-0.71
POSITIVE LOGITS
boundaries
1.39
boundary
1.22
Bound
0.90
markers
0.86
barriers
0.75
bounds
0.75
Limits
0.75
thresholds
0.72
delim
0.72
gates
0.72
Activations Density 0.011%