INDEX
Explanations
terms related to design and media, particularly in the context of fashion and literature
New Auto-Interp
Negative Logits
setVerticalGroup
-0.57
ſur
-0.57
виправивши
-0.56
};*/
-0.55
cauſe
-0.54
ſta
-0.51
uſed
-0.50
purpoſe
-0.50
ſtand
-0.49
IsContent
-0.49
POSITIVE LOGITS
junk
0.79
Junk
0.78
holic
0.75
Junk
0.74
junk
0.72
addict
0.70
erati
0.69
buff
0.69
fanatic
0.68
thusi
0.67
Activations Density 0.289%