INDEX
Explanations
mentions of intentional or unintentional actions
terms related to intentionality in actions or events
New Auto-Interp
Negative Logits
Tens
-0.80
ularity
-0.77
stocks
-0.77
ings
-0.73
ciating
-0.73
heit
-0.72
rooms
-0.72
eting
-0.71
INGS
-0.71
uster
-0.70
POSITIVE LOGITS
unintentional
1.27
intentional
1.19
unintended
0.92
accidental
0.82
inadvert
0.82
idental
0.81
aneous
0.80
heon
0.77
intention
0.74
unintentionally
0.74
Activations Density 0.014%