INDEX
Explanations
geographical locations and demographic references
New Auto-Interp
Negative Logits
straw
-0.16
Casc
-0.15
Morgan
-0.15
zan
-0.14
Wings
-0.14
ãĤĬãģ¨
-0.14
Barb
-0.14
é̏
-0.14
Pun
-0.13
éĿ
-0.13
POSITIVE LOGITS
rein
0.30
Lap
0.26
Trom
0.20
lap
0.20
LAP
0.19
ieu
0.17
bear
0.17
Bear
0.17
Rein
0.16
áj
0.16
Activations Density 0.008%