INDEX
Explanations
prepositions and pronouns followed by certain nouns or phrases
phrases indicating possession or association
New Auto-Interp
Negative Logits
Champ
-0.80
ãĤ¼ãĤ¦ãĤ¹
-0.74
TEXT
-0.70
govtrack
-0.69
pedia
-0.65
à¦
-0.63
Sport
-0.63
crore
-0.61
cham
-0.60
IQ
-0.60
POSITIVE LOGITS
course
0.86
relations
0.74
sorts
0.70
wanting
0.70
the
0.69
elia
0.68
his
0.68
aspiring
0.68
those
0.67
raltar
0.64
Activations Density 0.056%