INDEX
Explanations
phrases describing an extreme level or limit of something
phrases that indicate reaching an extreme situation or outcome
New Auto-Interp
Negative Logits
irm
-0.74
rounder
-0.69
annis
-0.67
thia
-0.66
ellow
-0.64
beit
-0.62
avorite
-0.62
irmed
-0.61
anus
-0.61
olly
-0.61
POSITIVE LOGITS
where
0.80
liest
0.79
lessness
0.77
absurdity
0.75
brink
0.75
exhaustion
0.71
points
0.70
verge
0.70
point
0.69
ophys
0.69
Activations Density 0.029%