INDEX
Explanations
aerobic and anaerobic contexts
New Auto-Interp
Negative Logits
ק
0.78
betrieb
0.73
y
0.72
k
0.71
<0x0D>
0.70
criminal
0.69
ネ
0.69
क
0.68
нага
0.67
bord
0.66
POSITIVE LOGITS
aerobic
0.74
的部分
0.64
인을
0.64
ూర్
0.63
경우
0.63
말미암아
0.62
이지만
0.61
Gotta
0.59
facult
0.58
Chaucer
0.58
Activations Density 0.002%