INDEX
Explanations
neologisms or terms that signify change or evolution, particularly regarding historical contexts
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.08
3:0.06
4:0.08
5:0.03
6:0.34
7:0.05
8:0.04
9:0.05
10:0.12
11:0.06
Negative Logits
Unloaded
-1.54
rored
-1.52
POR
-1.47
IUM
-1.40
66666666
-1.39
votes
-1.38
Available
-1.37
afety
-1.37
ongyang
-1.36
レ
-1.35
POSITIVE LOGITS
narratives
1.59
traditions
1.53
ivism
1.52
ivist
1.50
nostalg
1.49
Celebration
1.48
sensibilities
1.48
renaissance
1.46
histor
1.45
Romantic
1.44
Activations Density 0.001%