INDEX
Explanations
negation and conceptual description
New Auto-Interp
Negative Logits
লা
0.46
हमारे
0.44
महिला
0.43
প্রতিদিন
0.43
调节
0.43
पाणी
0.43
ล
0.42
>");
0.41
আগুন
0.41
不上
0.41
POSITIVE LOGITS
hypothesized
0.54
conceptually
0.54
syntax
0.53
dictionaries
0.50
notation
0.49
descriptive
0.48
describing
0.48
multilingual
0.48
languages
0.47
conceptual
0.47
Activations Density 0.006%