INDEX
Explanations
proper nouns, specifically names of individuals or locations
New Auto-Interp
Negative Logits
âĶĢâĶĢ
-0.74
ANGEL
-0.70
ktop
-0.70
BOX
-0.69
VIDE
-0.66
daytime
-0.66
srfAttach
-0.62
volume
-0.61
Serie
-0.61
PROG
-0.60
POSITIVE LOGITS
baugh
1.31
enberg
1.25
hoff
1.17
hart
1.14
ley
1.12
love
1.10
ingham
1.09
gren
1.08
berger
1.07
meyer
1.06
Activations Density 0.231%