INDEX
Explanations
names of people or locations
proper nouns, specifically names of individuals and locations
New Auto-Interp
Negative Logits
Downloadha
-0.75
Frozen
-0.68
ndra
-0.67
warm
-0.64
favour
-0.63
tics
-0.62
adesh
-0.62
Guilty
-0.61
polar
-0.59
daily
-0.58
POSITIVE LOGITS
ONSORED
0.83
Department
0.67
ENSE
0.66
itzer
0.65
utenberg
0.65
ourke
0.63
ãĤ¼ãĤ¦ãĤ¹
0.63
agall
0.59
Dept
0.59
nel
0.58
Activations Density 0.309%