INDEX
Explanations
significant or impactful moments and events
New Auto-Interp
Negative Logits
owa
-0.17
oria
-0.15
éĢĶ
-0.14
Minor
-0.14
ová
-0.14
orough
-0.14
ausal
-0.14
ãĥ¼ãĥŃ
-0.14
imo
-0.14
slightly
-0.14
POSITIVE LOGITS
deal
0.24
contrast
0.19
DEAL
0.19
Deal
0.17
Deal
0.17
part
0.17
departure
0.16
improvement
0.16
difference
0.16
portion
0.16
Activations Density 0.082%