INDEX
Explanations
terms and phrases associated with critical assessment or evaluation
New Auto-Interp
Negative Logits
thalene
-0.81
Pog
-0.76
adə
-0.73
Pog
-0.68
FlowLayout
-0.65
illoma
-0.63
Controllo
-0.62
wars
-0.61
liflower
-0.61
للاسماء
-0.60
POSITIVE LOGITS
kritik
1.01
Cri
0.99
Crit
0.95
Kri
0.91
Cristóbal
0.90
rítica
0.89
CRI
0.85
Crítica
0.82
kras
0.81
Kritik
0.81
Activations Density 0.121%