INDEX
Explanations
phrases that indicate age or duration, particularly in relation to objects or systems
New Auto-Interp
Negative Logits
ãĥŃãĥ¼
-0.17
άνι
-0.15
399
-0.14
Coy
-0.14
bs
-0.14
rud
-0.14
å²Ĺ
-0.13
gart
-0.13
tslib
-0.13
hooked
-0.13
POSITIVE LOGITS
older
0.62
age
0.59
aged
0.54
oldest
0.53
Older
0.51
ages
0.49
old
0.47
-aged
0.46
older
0.46
Age
0.44
Activations Density 0.186%