INDEX
    Explanations

    elements related to research methodology and analysis

    New Auto-Interp
    Negative Logits
    ########.
    -0.44
    Diwedd
    -0.44
    riwal
    -0.39
     wikipagina
    -0.39
    pholes
    -0.39
     twij
    -0.39
     okuyayım
    -0.38
    yttö
    -0.37
     goederen
    -0.37
     незавершена
    -0.37
    POSITIVE LOGITS
     theme
    3.66
     themes
    3.33
     Theme
    3.05
     thème
    2.94
    theme
    2.89
     THEME
    2.86
    Theme
    2.81
     Themes
    2.75
     tema
    2.67
    主题
    2.58
    Act Density 0.498%

    No Known Activations