INDEX
Explanations
phrases related to physical structures
references to different types of structures
New Auto-Interp
Negative Logits
atz
-0.68
bat
-0.65
sung
-0.64
lethal
-0.64
Miss
-0.64
Medals
-0.63
bert
-0.63
deals
-0.63
hops
-0.63
Detective
-0.62
POSITIVE LOGITS
structure
3.64
Structure
2.85
structures
2.65
ructure
1.78
stru
1.71
Struct
1.70
structured
1.50
structural
1.44
mechanism
1.43
struct
1.39
Activations Density 0.022%