INDEX
Explanations
comparisons between different items
New Auto-Interp
Negative Logits
தடை
0.46
отсут
0.45
ebabkan
0.43
超时
0.42
其实
0.42
atthena
0.41
iasco
0.40
传统
0.40
Weltkrieg
0.40
ක්ර
0.39
POSITIVE LOGITS
identical
0.68
different
0.66
разных
0.58
identical
0.57
identically
0.56
diferentes
0.55
unterschied
0.55
individuales
0.55
similares
0.55
разные
0.55
Activations Density 0.130%