INDEX
Explanations
instances of exposure or revealing something, particularly in technical contexts
New Auto-Interp
Negative Logits
Benzema
-0.30
Hotspur
-0.30
andalf
-0.30
itrile
-0.28
dépos
-0.28
tiens
-0.28
établie
-0.27
InBytes
-0.27
zonych
-0.26
杯
-0.26
POSITIVE LOGITS
exposed
0.83
Хьажоргаш
0.80
exposed
0.80
Exposed
0.76
exposing
0.75
Exposed
0.74
露出
0.73
uncovered
0.71
bare
0.70
expose
0.69
Activations Density 0.554%