INDEX
Explanations
proper nouns referring to individuals, likely names
references to specific individuals, particularly "Gret" and related names
New Auto-Interp
Negative Logits
shirt
-0.80
xual
-0.78
termin
-0.72
shirts
-0.70
panel
-0.69
checkout
-0.67
zees
-0.64
REDACTED
-0.62
Pigs
-0.62
Jinping
-0.61
POSITIVE LOGITS
alus
0.93
sburg
0.89
olf
0.84
heim
0.80
inka
0.79
ür
0.79
ald
0.78
ersen
0.75
alf
0.75
ij
0.74
Activations Density 0.030%