INDEX
Explanations
proper names
proper nouns, specifically names of people
New Auto-Interp
Negative Logits
âĶĢâĶĢ
-0.73
ANGEL
-0.72
Sof
-0.69
Pradesh
-0.67
Petra
-0.66
ãĥŁ
-0.65
Haram
-0.62
Heavenly
-0.62
Sasha
-0.62
Nusra
-0.60
POSITIVE LOGITS
enstein
1.13
nick
1.13
inger
1.10
enberg
1.09
quist
1.09
ansky
1.09
zinski
1.08
anson
1.06
inski
1.06
insky
1.05
Activations Density 0.247%