INDEX
Explanations
proper nouns
references to specific individuals, particularly those with the last name "Klein."
New Auto-Interp
Negative Logits
iate
-0.67
ially
-0.67
iment
-0.67
iments
-0.67
Inqu
-0.66
iations
-0.63
icts
-0.60
intervals
-0.59
facilit
-0.59
Relief
-0.58
POSITIVE LOGITS
pin
0.91
bard
0.82
toe
0.80
perm
0.79
tag
0.76
baum
0.75
amas
0.74
jas
0.71
Lerner
0.70
ashtra
0.69
Activations Density 0.152%