INDEX
Explanations
phrases indicating possession or association
references to individuals or entities characterized by the word "whose."
New Auto-Interp
Negative Logits
ptions
-0.62
PLA
-0.60
rums
-0.58
aze
-0.54
eco
-0.53
Alright
-0.51
ming
-0.51
info
-0.51
Things
-0.50
mo
-0.50
POSITIVE LOGITS
whose
3.15
whose
2.68
whom
2.31
who
1.74
who
1.61
whence
1.36
which
1.32
wherein
1.17
WHO
1.03
which
1.03
Activations Density 0.021%