INDEX
Explanations
words related to personal pronouns
New Auto-Interp
Negative Logits
ſtate
-0.58
DataLoader
-0.56
uſe
-0.54
vább
-0.53
DataLoader
-0.53
uſed
-0.51
-0.50
purpoſe
-0.50
}[!
-0.49
poveznice
-0.49
POSITIVE LOGITS
his
1.20
His
1.07
His
1.06
own
0.96
his
0.94
kanyang
0.93
their
0.91
Their
0.91
HIS
0.90
seiner
0.90
Activations Density 0.512%