INDEX
Explanations
contextual elaboration after specific words
New Auto-Interp
Negative Logits
replete
0.44
hinch
0.43
웃
0.43
flashing
0.43
exertion
0.42
sacrific
0.41
Sousa
0.41
人体
0.41
มาก
0.41
mutagenesis
0.40
POSITIVE LOGITS
tors
0.53
frameNStart
0.50
xw
0.47
authorised
0.46
GST
0.46
Príncipe
0.44
國內
0.43
ಕಂಚ
0.43
Jockey
0.43
ゅう
0.42
Activations Density 0.003%