INDEX
Explanations
professionals or experts in various fields
references to experts or authority figures in various fields
New Auto-Interp
Negative Logits
etition
-0.72
nih
-0.69
tu
-0.67
rity
-0.66
umo
-0.64
imester
-0.64
flix
-0.64
iasm
-0.63
rez
-0.63
query
-0.63
POSITIVE LOGITS
extraord
0.96
proponent
0.90
alike
0.87
scourge
0.86
contributor
0.85
holder
0.84
supporter
0.84
founding
0.83
protector
0.83
collaborator
0.82
Activations Density 0.242%