INDEX
Explanations
text related to official inquiries or investigations
New Auto-Interp
Negative Logits
er
-0.81
ness
-0.70
ner
-0.64
eness
-0.61
Bü
-0.61
وسلم
-0.61
enna
-0.59
uxxxx
-0.57
na
-0.56
ena
-0.56
POSITIVE LOGITS
investigations
1.65
investigation
1.54
Investigate
1.49
Investigations
1.49
Investigations
1.45
INVESTIGATION
1.42
Investigation
1.39
investigate
1.38
investigation
1.38
investigated
1.37
Activations Density 0.091%