INDEX
Explanations
references to high quality across various contexts
New Auto-Interp
Negative Logits
lang
-0.15
Pill
-0.14
heid
-0.14
aira
-0.14
ElementException
-0.13
oru
-0.13
-pill
-0.13
aus
-0.13
kim
-0.13
delayed
-0.13
POSITIVE LOGITS
eum
0.19
753
0.17
аниÑĨ
0.16
ilar
0.15
plib
0.15
Strauss
0.15
FirstChild
0.14
atak
0.14
PACE
0.14
pong
0.14
Activations Density 0.021%