INDEX
Explanations
references to assistance or support in various contexts
New Auto-Interp
Negative Logits
nette
-0.56
crom
-0.55
<eos>
-0.53
Rech
-0.53
fels
-0.52
Filmografie
-0.52
es
-0.50
People
-0.50
…………………………………………
-0.50
Lübeck
-0.50
POSITIVE LOGITS
aid
4.27
Aid
4.06
Aid
3.56
aid
3.29
AID
3.08
aids
2.99
AID
2.54
Aids
2.52
aiding
2.40
aide
2.36
Activations Density 0.066%