INDEX
Explanations
elements indicating stages of progress or commitment in processes or studies
New Auto-Interp
Negative Logits
neté
-0.59
intervened
-0.54
humiliating
-0.52
severity
-0.52
poor
-0.51
usually
-0.50
ijnt
-0.50
unik
-0.50
parietal
-0.50
どうしても
-0.50
POSITIVE LOGITS
forward
0.96
future
0.92
future
0.92
これからの
0.88
continuare
0.87
これから
0.86
avenir
0.84
continue
0.83
今後の
0.83
Future
0.82
Activations Density 0.192%