INDEX
Explanations
temporal references and durations
New Auto-Interp
Negative Logits
gyhoeddwyd
-0.63
يتيمه
-0.62
inev
-0.61
entourage
-0.57
hastened
-0.57
Aktualisiert
-0.57
opro
-0.56
pokémon
-0.55
Majefty
-0.55
ftagPool
-0.54
POSITIVE LOGITS
hindurch
0.78
hinweg
0.76
endwhile
0.70
esternos
0.69
boyunca
0.64
remain
0.58
kept
0.55
always
0.55
一直
0.54
までは
0.54
Activations Density 0.364%