INDEX
    Explanations

    different types of things

    New Auto-Interp
    Negative Logits
     सहज
    0.40
     महत्वपूर्ण
    0.38
    आय
    0.38
     ненави
    0.38
    아이
    0.37
     важ
    0.37
     அன்பு
    0.36
    頑張
    0.36
     গুরুত্বপূর্ণ
    0.36
     মাইক্র
    0.36
    POSITIVE LOGITS
     vibe
    0.63
     kettle
    0.62
     kett
    0.56
     kinda
    0.53
     type
    0.52
     kind
    0.52
     style
    0.51
     thing
    0.46
     realms
    0.46
     universes
    0.46
    Act Density 0.007%

    No Known Activations