INDEX
Explanations
phrases indicating the acquisition of knowledge or benefits
New Auto-Interp
Negative Logits
Roberts
-0.66
Georg
-0.62
Stevenson
-0.62
precau
-0.62
Schröder
-0.61
Brown
-0.61
pory
-0.60
Corcoran
-0.60
Schroeder
-0.59
precios
-0.58
POSITIVE LOGITS
GAIN
1.85
Gain
1.78
Gains
1.72
gain
1.71
gain
1.70
gains
1.67
Gain
1.65
gained
1.61
GAIN
1.50
gains
1.46
Activations Density 0.061%