INDEX
Negative Logits
Assessment
0.78
inability
0.73
Assessment
0.67
缺乏
0.65
保持
0.63
interaction
0.61
incapable
0.61
निहित
0.61
内容
0.61
Axes
0.60
POSITIVE LOGITS
ulfate
0.80
sulfate
0.79
gathered
0.79
Dent
0.76
dentist
0.76
gleaned
0.75
meant
0.75
sempl
0.74
cuc
0.74
intended
0.74
Activations Density 0.059%