INDEX
Explanations
numerical ratings or scores
New Auto-Interp
Head Attr Weights
0:0.13
1:0.03
2:0.08
3:0.06
4:0.12
5:0.03
6:0.14
7:0.01
8:0.14
9:0.04
10:0.05
11:0.12
Negative Logits
stood
-1.50
grounds
-1.49
idelity
-1.47
counterpart
-1.46
ledged
-1.46
overt
-1.45
plat
-1.44
negotiating
-1.43
grasp
-1.37
contingent
-1.34
POSITIVE LOGITS
ophon
1.73
etc
1.56
ffiti
1.53
inea
1.50
��
1.44
................
1.42
Shepherd
1.41
illance
1.39
ESE
1.37
@@
1.37
Activations Density 0.005%