INDEX
Explanations
phrases related to origins or causes of issues
New Auto-Interp
Negative Logits
taker
-0.63
mouth
-0.63
dogs
-0.62
pads
-0.59
bowls
-0.59
monitors
-0.58
Breaker
-0.57
/-
-0.57
tumble
-0.57
thumbs
-0.56
POSITIVE LOGITS
partly
0.96
rooted
0.86
unden
0.83
chiefly
0.83
stems
0.78
principally
0.77
traced
0.77
manifold
0.76
tymology
0.75
origins
0.75
Activations Density 0.191%