INDEX
Explanations
names of sports players
various forms of parentheses
New Auto-Interp
Negative Logits
contempor
-0.81
tabloid
-0.77
genre
-0.76
nature
-0.75
propag
-0.74
breed
-0.74
ram
-0.73
variations
-0.73
perme
-0.72
refreshing
-0.72
POSITIVE LOGITS
seven
1.53
nine
1.52
fifth
1.51
six
1.51
fourth
1.49
eight
1.49
four
1.48
three
1.46
third
1.46
five
1.44
Activations Density 0.111%