INDEX
    Explanations

    phrases related to a specific controversial viewpoint on having children

    New Auto-Interp
    Negative Logits
     Gorb
    -0.70
     Rine
    -0.57
     Hecht
    -0.52
     kasa
    -0.52
     Knud
    -0.51
     Barbier
    -0.51
    $-$\\
    -0.50
     Hildebrand
    -0.50
    ↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
    -0.50
     Schenk
    -0.49
    POSITIVE LOGITS
     paradiso
    0.63
     further
    0.56
    RenderAtEndOf
    0.54
     compleanno
    0.52
     furt
    0.51
     FURTHER
    0.51
    leggings
    0.51
     bacio
    0.51
     divertimento
    0.49
     step
    0.49
    Act Density 0.165%

    No Known Activations