INDEX
Explanations
instances of the word "turn" and its variations in different contexts
New Auto-Interp
Negative Logits
ſche
-0.79
pleaſure
-0.79
IntoConstraints
-0.71
niyang
-0.70
leſs
-0.69
houſe
-0.66
hasMoreElements
-0.66
ainfi
-0.66
doubtnut
-0.65
richi
-0.65
POSITIVE LOGITS
Turn
0.85
arounds
0.82
Turn
0.82
TURN
0.81
TURN
0.78
turns
0.78
around
0.77
Turning
0.77
turn
0.76
upside
0.75
Activations Density 0.075%