INDEX
Explanations
keywords referring to specific objects or entities within a broader context
references to specific entities or items denoted as "ones."
New Auto-Interp
Negative Logits
Cl
-0.69
Phys
-0.67
Dep
-0.64
inson
-0.63
osponsors
-0.62
Ec
-0.61
amen
-0.61
ITED
-0.61
Pwr
-0.61
Dem
-0.60
POSITIVE LOGITS
hots
0.87
omething
0.80
eyed
0.74
cott
0.73
Hundred
0.72
Thousand
0.70
elf
0.69
ones
0.69
chwitz
0.68
Obi
0.67
Activations Density 0.034%