INDEX
Explanations
mentions of specific names, likely related to social media accounts or individuals
proper nouns and names related to individuals or organizations
New Auto-Interp
Negative Logits
zos
-0.90
Rez
-0.89
Jonas
-0.86
Pok
-0.86
Zen
-0.83
zn
-0.83
zo
-0.82
ten
-0.82
Eisen
-0.81
Pie
-0.75
POSITIVE LOGITS
iler
0.87
av
0.85
taboola
0.84
ave
0.84
aver
0.83
BB
0.81
aving
0.80
CRIP
0.80
cle
0.78
iling
0.77
Activations Density 0.589%