INDEX
Explanations
mentions of the word "world" or related concepts indicating a global perspective
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.18
3:0.04
4:0.05
5:0.03
6:0.05
7:0.06
8:0.10
9:0.03
10:0.19
11:0.18
Negative Logits
ocious
-1.76
cure
-1.60
ixir
-1.57
ouf
-1.54
Cure
-1.52
Transformation
-1.48
ビ
-1.46
miracle
-1.45
resurrection
-1.44
olution
-1.42
POSITIVE LOGITS
sites
1.77
Apps
1.69
abases
1.68
includ
1.67
folders
1.58
endars
1.57
apps
1.57
uthor
1.57
idates
1.57
favorites
1.55
Activations Density 0.013%