INDEX
Negative Logits
deeper
0.69
There
0.67
basics
0.65
देखना
0.61
हराकर
0.61
These
0.61
жерела
0.61
approachable
0.60
Those
0.60
Pure
0.59
POSITIVE LOGITS
]:
0.70
.}\
0.66
setminus
0.64
']:
0.64
]=
0.63
}:
0.63
ד
0.61
ถาม
0.61
ר
0.61
.}
0.60
Activations Density 0.085%