INDEX
    Explanations

    references to the concept of 'twins'

    references to twins or twin-related concepts

    New Auto-Interp
    Negative Logits
    anwhile
    -0.87
     CoC
    -0.84
    andise
    -0.76
    UME
    -0.71
    uddin
    -0.67
    ulkan
    -0.66
    ãģĵ
    -0.65
    pmwiki
    -0.65
    aining
    -0.64
    ktop
    -0.64
    POSITIVE LOGITS
    ned
    1.22
    ning
    1.18
    fold
    0.92
     towers
    0.88
     brother
    0.85
     peaks
    0.82
    xes
    0.81
     sister
    0.80
     Peaks
    0.78
     brothers
    0.77
    Act Density 0.031%

    No Known Activations