INDEX
Explanations
terms related to actions or decisions (e.g., reversal, developments, finding, decision)
key events or actions related to decisions and developments
New Auto-Interp
Negative Logits
Native
-0.74
Tang
-0.64
Zip
-0.63
avis
-0.63
;;
-0.61
burning
-0.61
Scotland
-0.61
Choose
-0.60
Katrina
-0.60
ochet
-0.60
POSITIVE LOGITS
coincides
0.84
coincided
0.81
prompted
0.77
relates
0.76
underscores
0.75
embold
0.75
includes
0.73
consists
0.73
amounted
0.73
consisted
0.73
Activations Density 0.391%