INDEX
Explanations
linking verbs or identifying phrases
New Auto-Interp
Negative Logits
Its
0.88
которое
0.85
Its
0.76
którego
0.75
একটি
0.75
sebuah
0.67
itself
0.66
一个
0.66
kuris
0.65
яке
0.64
POSITIVE LOGITS
themselves
1.46
რომლებიც
1.20
considerados
1.09
也都
1.07
ඒවා
1.07
जिनमें
1.05
كلهم
1.05
mselves
1.04
आहेत
1.02
những
1.02
Activations Density 0.444%