INDEX
Explanations
the presence of the special token `<bos>`, which likely indicates the beginning of a segment or section in the text
New Auto-Interp
Negative Logits
([^
-0.84
aktery
-0.82
覆
-0.77
Bradford
-0.77
xss
-0.76
PMS
-0.76
McGrath
-0.75
getConfiguration
-0.74
cila
-0.72
ashe
-0.72
POSITIVE LOGITS
***!
0.96
Roja
0.93
0.91
Gorbachev
0.91
Eilish
0.89
Monfieur
0.88
Rolf
0.87
Tao
0.87
tann
0.86
sap
0.84
Activations Density 0.076%