INDEX
Explanations
references to clothing, especially t-shirts with slogans or symbols
New Auto-Interp
Negative Logits
$__
-0.15
usize
-0.14
à¸ł
-0.14
WARRANTY
-0.14
::*
-0.14
rick
-0.14
reon
-0.14
ì°©
-0.13
reck
-0.13
erland
-0.13
POSITIVE LOGITS
reads
0.24
words
0.23
saying
0.21
Reads
0.20
text
0.20
inscription
0.20
wording
0.20
slogan
0.20
message
0.19
words
0.19
Activations Density 0.224%