INDEX
Explanations
complex sentence structures and punctuation
New Auto-Interp
Head Attr Weights
0:0.05
1:0.02
2:0.06
3:0.04
4:0.06
5:0.04
6:0.20
7:0.05
8:0.09
9:0.27
10:0.03
11:0.04
Negative Logits
Cabrera
-4.37
eva
-3.97
[*
-3.80
BALL
-3.62
Bezos
-3.58
Chest
-3.40
████
-3.35
Hera
-3.30
Flask
-3.23
Venus
-3.19
POSITIVE LOGITS
Northern
8.84
Northern
7.95
orthern
5.97
NI
5.87
Southern
5.45
Southern
5.44
outhern
5.42
Ulster
5.36
Belfast
4.95
northern
4.88
Activations Density 0.009%