I want to remove each of the following special characters from my documents:
symbols = {`,~,!,@,#,$,%,^,&,*,(,),_,-,+,=,{,[,],},|,\,:,;,",<,,,>,.,?,/}
The reason why I am not simply doing something like this:
document = re.sub(r'([^\s\w]|_)+', '', document)
is that in this way I remove also many (accented/special) letters in the case of documents written in languages such as Polish etc.
How can I remove each of the special characters above in one expression?