INDEX
Explanations
phrases that involve possessive and definitive pronouns
New Auto-Interp
Negative Logits
baro
-0.57
Wes
-0.54
Benth
-0.52
dấu
-0.52
nó
-0.51
Wes
-0.50
Kraj
-0.50
ombat
-0.50
valho
-0.50
IZABETH
-0.49
POSITIVE LOGITS
gotten
1.11
gets
1.08
Gets
1.07
Gets
1.04
GETS
0.98
got
0.92
get
0.92
gets
0.87
Få
0.84
Get
0.82
Activations Density 0.068%