INDEX
Explanations
analogy comparison metaphor simile
New Auto-Interp
Negative Logits
iesa
0.40
")),
0.39
ंबा
0.38
Authentication
0.35
जानते
0.35
সম্মিলিত
0.35
ineri
0.35
உள்ப
0.35
अड
0.34
!("{:0.34
POSITIVE LOGITS
analogy
2.63
analogies
2.55
metaphor
2.22
comparison
2.19
comparisons
2.14
metaphors
2.14
срав
2.06
likened
2.00
porówn
2.00
simile
1.98
Activations Density 0.037%