INDEX
Explanations
negative states and judgments
New Auto-Interp
Negative Logits
dichloromethane
0.72
resistant
0.72
encounters
0.70
Geometric
0.69
reverber
0.68
存在する
0.66
unreported
0.66
anthe
0.66
落地
0.66
obscured
0.65
POSITIVE LOGITS
selfish
1.51
selfishness
1.46
crazy
1.42
foolish
1.41
silly
1.35
ridiculous
1.33
coward
1.31
madness
1.30
craziness
1.28
stupid
1.23
Activations Density 0.259%