INDEX
Explanations
the word "ELL" at high activation levels
occurrences of the word "yellowstone" and its variations, along with mentions of "tut" and "stem"
New Auto-Interp
Negative Logits
ãĥ³ãĤ¸
-0.82
iously
-0.70
ance
-0.70
ious
-0.69
Bis
-0.65
owship
-0.64
atus
-0.63
Bene
-0.63
bis
-0.62
ateur
-0.61
POSITIVE LOGITS
ELL
4.21
tut
1.42
EAR
1.04
tutor
1.03
alam
1.01
tty
0.92
stem
0.90
EV
0.83
ugi
0.82
elli
0.79
Activations Density 0.037%