INDEX
Explanations
know that / understand that / remember that
New Auto-Interp
Negative Logits
ہو۔
0.57
گئی۔
0.48
ہوں۔
0.47
Он
0.47
Ма
0.46
جائے۔
0.46
വേദിക
0.45
کریں۔
0.44
കൂട
0.44
گی۔
0.43
POSITIVE LOGITS
,
0.76
while
0.73
,"
0.68
,]
0.65
,”
0.65
since
0.64
though
0.64
,’
0.64
unequivocally
0.64
nobody
0.63
Activations Density 0.124%