INDEX
Explanations
questions or statements indicating a need for action or resolution
phrases that indicate necessity or requirement for action
New Auto-Interp
Negative Logits
gdala
-0.72
Zip
-0.65
Democr
-0.64
Rost
-0.62
izon
-0.62
Mamm
-0.61
Guilty
-0.59
Varg
-0.57
theless
-0.56
ulty
-0.56
POSITIVE LOGITS
attention
0.94
lessly
0.91
scrutiny
0.82
FINE
0.77
tweaking
0.77
ENTION
0.75
updating
0.75
Citation
0.73
correction
0.72
refinement
0.72
Activations Density 0.078%