INDEX
Explanations
words related to specific names and locations
references to specific names and terms related to people and places
New Auto-Interp
Negative Logits
willingness
-0.65
shelf
-0.63
shelves
-0.63
HOME
-0.62
sky
-0.61
point
-0.60
prints
-0.60
slick
-0.60
rolling
-0.59
beacon
-0.59
POSITIVE LOGITS
riv
1.36
ilege
1.09
icz
0.97
loo
0.86
ulously
0.85
abulary
0.85
atical
0.84
itives
0.83
anium
0.83
PLIC
0.83
Activations Density 0.007%