INDEX
Explanations
instances of the word "turn"
references to the concept of "turns" in various contexts
New Auto-Interp
Negative Logits
inately
-0.79
capacity
-0.74
ropolitan
-0.69
ecause
-0.68
hered
-0.67
foundation
-0.66
hammad
-0.66
anyahu
-0.64
mble
-0.64
rator
-0.63
POSITIVE LOGITS
abouts
0.91
buck
0.87
about
0.83
coat
0.81
edit
0.78
ibility
0.76
shif
0.71
ings
0.69
ip
0.68
aho
0.68
Activations Density 0.019%