INDEX
Explanations
the core or heart of something
New Auto-Interp
Negative Logits
é
0.65
j
0.64
í
0.61
الحكومة
0.60
deterred
0.58
finanz
0.57
சிலர்
0.56
společnost
0.56
ная
0.55
హ
0.55
POSITIVE LOGITS
Core
0.74
Core
0.71
сердце
0.71
centerpiece
0.70
core
0.67
adjacency
0.62
centrale
0.62
bataille
0.61
سلاټ
0.61
Одно
0.60
Activations Density 0.270%