INDEX
Explanations
references to relationships or connections between entities
New Auto-Interp
Negative Logits
Third
-0.60
phú
-0.54
i
-0.53
-
-0.52
avanti
-0.51
e
-0.51
摘
-0.51
something
-0.50
다
-0.49
o
-0.48
POSITIVE LOGITS
whose
2.33
whose
2.26
Whose
2.22
Whose
2.14
whofe
2.12
whoſe
2.07
cuyas
1.82
cuya
1.76
cuyos
1.76
cuyo
1.72
Activations Density 0.029%