INDEX
Explanations
proper nouns
proper nouns, specifically names of people or entities
New Auto-Interp
Negative Logits
enegger
-0.62
SPONSORED
-0.60
helicop
-0.57
destro
-0.57
compe
-0.57
surv
-0.56
sugg
-0.56
disg
-0.55
agre
-0.55
nodd
-0.54
POSITIVE LOGITS
Å«
0.67
Profile
0.65
omon
0.57
Shin
0.56
Brewing
0.56
erville
0.53
Äģ
0.52
asma
0.49
anta
0.48
aj
0.48
Activations Density 0.383%