INDEX
Explanations
phrases related to legal or political actions
phrases indicating accountability or consequences related to actions
New Auto-Interp
Negative Logits
ety
-0.55
¬¼
-0.55
herent
-0.52
iku
-0.49
rack
-0.49
inately
-0.48
sylv
-0.47
aza
-0.47
estones
-0.47
corrid
-0.47
POSITIVE LOGITS
albeit
0.88
namely
0.76
Kinnikuman
0.75
although
0.74
including
0.69
etc
0.68
which
0.65
aka
0.65
however
0.63
according
0.62
Activations Density 1.295%