INDEX
Explanations
mentions of language and its various forms
language, languages,
New Auto-Interp
Negative Logits
ücks
-0.71
TypeDef
-0.58
︎
-0.57
greateſt
-0.56
ſelves
-0.55
ſelf
-0.55
occafion
-0.55
reaſon
-0.54
preſent
-0.54
microm
-0.54
POSITIVE LOGITS
Lang
0.89
LANG
0.87
Language
0.78
Languages
0.73
languages
0.73
Langu
0.73
Lang
0.71
язы
0.71
Languages
0.70
auge
0.70
Activations Density 0.106%