INDEX
Explanations
possessives and distinguishing entities
New Auto-Interp
Negative Logits
temperament
0.44
sculpture
0.39
region
0.38
atuan
0.37
0.36
own
0.35
Utopia
0.35
softening
0.35
colouring
0.35
вшей
0.35
POSITIVE LOGITS
votre
0.50
your
0.49
what
0.48
what
0.44
आपके
0.44
your
0.42
явля
0.41
ಅವರ
0.40
Ihrem
0.40
ہزار
0.40
Activations Density 0.000%