INDEX
Explanations
social media handles and links
New Auto-Interp
Negative Logits
abestanden
-0.97
amaño
-0.84
^(@)
-0.78
leſs
-0.78
UnsafeEnabled
-0.76
—
-0.76
―――――
-0.76
ſind
-0.74
VERTISEMENT
-0.74
iſt
-0.74
POSITIVE LOGITS
@
0.82
@
0.64
verdens
0.57
'@
0.56
(@
0.56
("@0.54
('@0.52
@@@
0.52
kier
0.52
token
0.52
Activations Density 0.195%