INDEX
Explanations
mentions of tattoos
words related to tattoos
New Auto-Interp
Negative Logits
leep
-0.74
士
-0.73
EH
-0.70
theless
-0.67
IDES
-0.67
audi
-0.66
ppe
-0.66
ĨĴ
-0.65
eger
-0.65
ESSION
-0.64
POSITIVE LOGITS
tattoos
0.97
tattoo
0.91
Tatt
0.88
pigment
0.87
artist
0.86
tatt
0.82
ink
0.81
aesthetic
0.80
scars
0.80
eful
0.78
Activations Density 0.038%