INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     stroller
    0.83
    etam
    0.81
     বাঙালী
    0.80
     repeatability
    0.80
     donkey
    0.79
     rake
    0.73
     strollers
    0.73
    ﺿ
    0.72
     scar
    0.72
     cordless
    0.71
    POSITIVE LOGITS
    teachers
    0.98
    hard
    0.96
     menengah
    0.85
    teacher
    0.78
    🏫
    0.76
     Teachers
    0.75
     thought
    0.75
     Hard
    0.75
    Teachers
    0.74
    درسة
    0.74
    Act Density 0.003%

    No Known Activations