INDEX
Explanations
narratives of growth and change over time
New Auto-Interp
Negative Logits
ogóle
-0.57
枉
-0.47
romi
-0.46
orteur
-0.44
segi
-0.44
otri
-0.43
ima
-0.43
henko
-0.43
totally
-0.43
låg
-0.43
POSITIVE LOGITS
gradually
1.10
progressively
1.06
Gradually
1.06
increasingly
1.02
越来越
0.99
越來越
0.95
dần
0.94
increasing
0.94
increasing
0.94
ujednoznacz
0.92
Activations Density 0.387%