INDEX
Explanations
quantitative values or measurements
patterns of evaluation and contrast in narratives
New Auto-Interp
Negative Logits
actionDate
-0.66
Babel
-0.62
Advice
-0.61
ãĤ©
-0.60
Photographer
-0.60
Canary
-0.60
ãĥĺ
-0.59
miscon
-0.58
Workers
-0.57
Lag
-0.57
POSITIVE LOGITS
nevertheless
0.93
nonetheless
0.83
poons
0.76
thrive
0.75
este
0.74
lean
0.72
ISTER
0.72
eston
0.68
perfectly
0.65
thriving
0.64
Activations Density 0.816%