INDEX
Explanations
words associated with actions or events over time, particularly those involving time markers and significant occurrences
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.18
3:0.18
4:0.11
5:0.04
6:0.04
7:0.08
8:0.06
9:0.04
10:0.12
11:0.06
Negative Logits
ゴン
-1.66
endif
-1.55
ILA
-1.53
xual
-1.39
metics
-1.37
ophon
-1.35
��
-1.34
decoration
-1.33
IDA
-1.30
%.
-1.30
POSITIVE LOGITS
uddenly
1.82
?",
1.82
uberty
1.74
prematurely
1.46
overth
1.42
ocious
1.42
?),
1.41
THEN
1.41
suddenly
1.39
rocket
1.38
Activations Density 0.164%