INDEX
Explanations
phrases related to investigations and dishonesty
instances of the character "ľ" in the text
New Auto-Interp
Negative Logits
raints
-0.76
matic
-0.75
condem
-0.75
Instr
-0.73
ropes
-0.72
rouse
-0.70
ulators
-0.70
grips
-0.69
organisers
-0.68
tyres
-0.68
POSITIVE LOGITS
ï¸ı
1.18
âĶĢâĶĢ
1.17
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
0.93
conom
0.90
×Ķ
0.85
Ł
0.85
ĸ
0.83
Ĩ
0.83
ł
0.83
¸
0.82
Activations Density 0.158%