INDEX
Explanations
proper nouns, particularly related to locations and people's names
proper nouns and specific locations or entities
New Auto-Interp
Negative Logits
rac
-0.84
opian
-0.74
bishop
-0.74
iris
-0.73
diving
-0.72
camp
-0.72
zag
-0.72
college
-0.71
amic
-0.71
bladder
-0.71
POSITIVE LOGITS
Less
1.26
Given
1.23
Enough
1.23
Remove
1.23
Where
1.22
Needs
1.21
Absent
1.21
Only
1.21
Not
1.20
Put
1.18
Activations Density 0.368%