INDEX
Explanations
references to self-driving or autonomous technology
mentions of self-driving technology
New Auto-Interp
Negative Logits
GOODMAN
-0.76
Syndicate
-0.75
Ashe
-0.74
Orchestra
-0.74
XIII
-0.73
ropolitan
-0.69
Flavoring
-0.69
————————
-0.67
ĸļ
-0.67
ī
-0.66
POSITIVE LOGITS
destruct
1.26
destruct
1.09
upload
0.86
explanatory
0.84
lect
0.81
same
0.77
uls
0.77
conscious
0.77
ortium
0.76
uded
0.75
Activations Density 0.015%