INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Cyfeiriadau
    -0.92
     smile
    -0.81
     happiness
    -0.81
    happiness
    -0.78
    DockStyle
    -0.75
    Happiness
    -0.71
    MemoryWarning
    -0.71
     smiles
    -0.70
    tvguidetime
    -0.69
    AsUp
    -0.68
    POSITIVE LOGITS
     Beds
    1.07
     beds
    1.05
     Bed
    0.96
    beds
    0.95
     BED
    0.93
    Beds
    0.92
    Bed
    0.90
    BED
    0.83
     bed
    0.80
     headboard
    0.77
    Act Density 0.061%

    No Known Activations