INDEX
Explanations
references to a specific individual's life events and achievements
New Auto-Interp
Negative Logits
itect
-0.74
daq
-0.74
needed
-0.71
ibaba
-0.71
hari
-0.71
ocument
-0.71
Trend
-0.70
inctions
-0.69
anguage
-0.69
女
-0.69
POSITIVE LOGITS
own
1.69
wife
1.37
father
1.28
panic
1.28
predecessor
1.27
daughter
1.26
successor
1.26
Majesty
1.23
nephew
1.23
niece
1.22
Activations Density 1.172%