INDEX
Explanations
phrases indicating prevention or avoidance of negative outcomes
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.07
3:0.08
4:0.28
5:0.02
6:0.03
7:0.25
8:0.03
9:0.03
10:0.05
11:0.07
Negative Logits
AppData
-1.80
cloth
-1.73
onyms
-1.57
database
-1.56
ebook
-1.53
Yaz
-1.44
Kardash
-1.43
Palest
-1.43
Qur
-1.41
Quran
-1.40
POSITIVE LOGITS
hostilities
1.95
deterioration
1.88
overload
1.87
inevitable
1.87
failure
1.85
impending
1.83
revolt
1.82
erosion
1.78
disruption
1.77
backlash
1.71
Activations Density 0.000%