INDEX
Explanations
references to people's names
occurrences of possessive pronouns related to people or entities
New Auto-Interp
Negative Logits
Zi
-0.68
Izan
-0.66
Kou
-0.65
ylon
-0.65
Iranians
-0.63
Europeans
-0.62
Cel
-0.61
Babe
-0.61
otin
-0.60
Lerner
-0.60
POSITIVE LOGITS
own
1.05
tremend
0.86
briefs
0.76
Own
0.75
yssey
0.75
introductory
0.74
travels
0.73
Ĥİ
0.70
stride
0.69
panic
0.68
Activations Density 0.132%