INDEX
Explanations
names of people, particularly "Blanc" and "Malkin"
proper nouns, particularly names of people and places
New Auto-Interp
Negative Logits
#$
-0.74
WARE
-0.70
ngth
-0.70
ILCS
-0.70
Defenders
-0.70
Slot
-0.68
FFER
-0.65
CLAIM
-0.64
©¶æ¥µ
-0.62
ictionary
-0.62
POSITIVE LOGITS
sis
0.85
enary
0.82
inx
0.81
Blanc
0.77
eas
0.75
gio
0.75
inos
0.73
ère
0.72
uci
0.71
het
0.70
Activations Density 0.011%