INDEX
Explanations
holds abstract concepts and relationships
New Auto-Interp
Negative Logits
weren
1.11
Were
1.04
were
1.03
rieron
1.03
were
1.02
Were
0.94
დნენ
0.91
voltak
0.89
WERE
0.88
fossero
0.87
POSITIVE LOGITS
tiene
2.39
позволяет
2.17
دارد
2.14
делает
2.13
дает
2.13
имеет
2.11
обеспечивает
2.09
представляет
2.08
έχει
2.03
contiene
2.01
Activations Density 0.162%