INDEX
Explanations
phrases prompting consideration or reflection
New Auto-Interp
Negative Logits
"},"
-0.71
ially
-0.71
Cause
-0.69
][/
-0.67
Written
-0.66
ccess
-0.66
\"
-0.65
\",
-0.63
"}],"
-0.63
Requires
-0.62
POSITIVE LOGITS
Exhibit
0.73
tainment
0.70
examples
0.70
Carly
0.68
Akron
0.68
Leban
0.68
example
0.67
Fukushima
0.67
Rwanda
0.67
Corinthians
0.67
Activations Density 0.103%