INDEX
Explanations
numerical values related to specific content or information like article IDs or dates
numeric identifiers or codes
New Auto-Interp
Negative Logits
ierrez
-0.82
oulos
-0.77
brace
-0.76
aughs
-0.71
anooga
-0.70
chio
-0.65
DRAG
-0.65
nomine
-0.64
Beir
-0.63
Combine
-0.63
POSITIVE LOGITS
88
0.98
66
0.97
646
0.96
802
0.96
003
0.96
014
0.96
69
0.96
201
0.95
67
0.95
89
0.95
Activations Density 0.099%