INDEX
Negative Logits
ᇁ
0.70
/*
0.62
Ꮈ
0.62
ᄈ
0.59
r
0.58
tfidf
0.57
Ⴖ
0.56
looked
0.56
ᄊ
0.56
ﺌ
0.54
POSITIVE LOGITS
5
0.62
crushes
0.59
8
0.58
ński
0.57
6
0.55
conocido
0.53
7
0.53
2
0.52
haters
0.52
</th>
0.52
Activations Density 0.676%
ᇁ
/*
Ꮈ
ᄈ
r
tfidf
Ⴖ
looked
ᄊ
ﺌ
5
crushes
8
ński
6
conocido
7
2
haters
</th>