INDEX
Explanations
instances where entities make their first notable appearance or debut
repeated mentions of possessive pronouns
New Auto-Interp
Negative Logits
hov
-0.82
earchers
-0.64
[];
-0.61
DN
-0.60
ibaba
-0.59
Guy
-0.58
Lago
-0.57
wr
-0.56
horizont
-0.56
Gi
-0.56
POSITIVE LOGITS
own
1.18
debut
0.88
stride
0.77
impression
0.77
selves
0.76
self
0.74
footing
0.73
customary
0.72
çİĭ
0.72
displeasure
0.70
Activations Density 0.043%