INDEX
Explanations
phrases related to tutorials or mentoring experiences
repeated mentions of the word "tut" and its variants, indicating references to tutorials
New Auto-Interp
Negative Logits
saline
-0.82
captivity
-0.75
士
-0.68
mine
-0.65
ISSION
-0.65
Lima
-0.64
Unknown
-0.63
Franch
-0.63
theless
-0.61
oples
-0.61
POSITIVE LOGITS
tle
1.23
tut
1.14
icket
1.03
te
1.01
ickets
0.97
ypes
0.96
anium
0.95
Tut
0.95
ilities
0.94
tops
0.93
Activations Density 0.025%