INDEX
Explanations
clarifying purpose or nature
New Auto-Interp
Negative Logits
だけでなく
0.45
但也
0.43
although
0.40
scanty
0.39
zowel
0.39
同様
0.38
but
0.36
αλλά
0.35
ஆனால்
0.35
लेकिन
0.35
POSITIVE LOGITS
fundamentally
0.53
THEM
0.41
conceptually
0.41
YOU
0.40
скорее
0.39
COMMUNITY
0.38
FUND
0.38
COMPANIES
0.38
오히려
0.38
YOUR
0.37
Activations Density 0.268%