INDEX
Explanations
terms related to scientific terms and research methodologies
New Auto-Interp
Negative Logits
.
-0.80
".
-0.73
'.
-0.72
”.
-0.70
.”
-0.63
’.
-0.63
").
-0.59
."
-0.59
').
-0.57
":
-0.57
POSITIVE LOGITS
principalColumn
0.69
ועוד
0.68
³,
0.67
nahilalakip
0.65
Билгалдахарш
0.62
etc
0.62
.$,
0.55
等等
0.54
PostConstruct
0.54
²,
0.53
Activations Density 0.990%