INDEX
Explanations
phrases indicating progress or advancement towards a goal
references to reaching a critical point of achievement or realization
New Auto-Interp
Negative Logits
oath
-0.67
Broad
-0.65
fared
-0.62
Synopsis
-0.61
sson
-0.60
messenger
-0.59
Evening
-0.59
---------
-0.58
Brav
-0.58
screenings
-0.57
POSITIVE LOGITS
viability
0.90
equilibrium
0.89
acceptable
0.84
feasible
0.81
irreversible
0.80
acceptable
0.80
manageable
0.80
uble
0.80
saturation
0.78
maximal
0.78
Activations Density 0.332%