INDEX
Explanations
references to actions and the necessity of taking action, particularly in relation to governmental or significant global issues
New Auto-Interp
Negative Logits
InputDecoration
-0.40
ViewFeatures
-0.39
joaat
-0.38
cakes
-0.37
disambiguazione
-0.37
ridas
-0.36
queles
-0.36
slidesToShow
-0.35
universities
-0.35
university
-0.34
POSITIVE LOGITS
action
0.93
actions
0.77
ACTION
0.77
acción
0.77
action
0.76
Action
0.72
行動
0.72
actie
0.71
Actions
0.69
ação
0.69
Activations Density 0.053%