INDEX
Explanations
proper nouns, particularly names associated with legal cases
New Auto-Interp
Head Attr Weights
0:0.14
1:0.07
2:0.06
3:0.07
4:0.06
5:0.04
6:0.08
7:0.10
8:0.03
9:0.06
10:0.15
11:0.08
Negative Logits
Rudy
-3.33
Cisco
-3.06
Mirage
-2.91
Natasha
-2.82
Ghostbusters
-2.81
Omni
-2.77
UD
-2.75
Barkley
-2.66
Deadpool
-2.63
dispatcher
-2.62
POSITIVE LOGITS
wh
4.27
whale
3.44
whales
3.32
Whale
3.29
Antarctic
3.23
サ
3.17
�
3.13
Japan
3.00
Fisheries
2.98
�
2.93
Activations Density 0.000%