INDEX
Explanations
bracketed text or items within parentheses
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.14
3:0.10
4:0.05
5:0.07
6:0.23
7:0.03
8:0.09
9:0.06
10:0.05
11:0.08
Negative Logits
natureconservancy
-1.72
Rush
-1.68
bryce
-1.64
�
-1.56
upt
-1.55
patch
-1.54
aea
-1.54
Scar
-1.45
��
-1.42
ky
-1.41
POSITIVE LOGITS
meanwhile
1.55
enhagen
1.49
compet
1.42
horizont
1.42
Monaco
1.41
Petersen
1.35
puter
1.34
Wilkinson
1.32
PLIED
1.32
roleum
1.28
Activations Density 0.002%