INDEX
Explanations
repeated mentions of a specific character named Don
Don followed by names
New Auto-Interp
Negative Logits
queryInterface
-0.62
uxxxx
-0.59
urtz
-0.56
OGND
-0.54
Vieira
-0.54
RTGC
-0.54
日閲覧
-0.53
atalos
-0.52
Tapia
-0.51
TokenNameLPAREN
-0.51
POSITIVE LOGITS
Don
1.63
Don
1.47
DON
0.89
Дон
0.82
Dono
0.71
Dons
0.68
Donald
0.68
Doesn
0.65
don
0.63
Dont
0.63
Activations Density 0.005%