INDEX
Explanations
mentions of specific animals, particularly turtles, and related terms
references to turtles and related themes
New Auto-Interp
Negative Logits
Interstitial
-0.90
ãĥ¼ãĥĨ
-0.80
âĸ¬
-0.73
ãĥ¼ãĥ³
-0.70
×Ļ×
-0.68
heimer
-0.68
selection
-0.66
Null
-0.64
Dull
-0.64
Ö¼
-0.63
POSITIVE LOGITS
turtle
1.23
turtles
1.23
Turtles
1.07
brates
1.00
Turtle
0.99
brate
0.99
urtle
0.91
urtles
0.88
pole
0.85
face
0.84
Activations Density 0.012%