INDEX
Explanations
geographical names and locations
New Auto-Interp
Negative Logits
ZF
-0.17
akis
-0.17
roti
-0.15
ди
-0.15
-runtime
-0.15
пÑĥ
-0.14
Tulsa
-0.14
kara
-0.14
ariat
-0.14
iou
-0.14
POSITIVE LOGITS
Madison
0.26
Dane
0.24
Wis
0.20
Bad
0.20
Mad
0.19
Wis
0.18
mad
0.18
Marathon
0.17
Bad
0.17
Wisconsin
0.17
Activations Density 0.025%