I am working on a project with a large dataset from WebofScience and Scopus. Because of this, I have to make a very long alias list to account of slight variations in spelling, abbreviation, and capitalization of institutions. Is there any way to make the analysis case INsensitive, to reduce the work to be done on the alias list? It seems like countries are all case insensitive, but I am having trouble consolidating the institutions based on differences in what is capitalized.
Thank you!
Claire Sorenson
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thanks for your suggestion. I am making the following changes:
1. Add a new function to generate a template alias file, containing all the nodes in two columns separated by a tab. The file name is template_citespace.alias.
2. Visualize the network as usual, then generate the template from menu: Export > Nodes: Generate an Alias Template
3. Users can modify the template file as a plain text in Notepad++ or as a TSV file in Excel and save it as citespace.alias for CiteSpace to use.
4. Modify the template in two ways: a): Modify the first column, which is how it will appear, and/or 2) Remove rows you don't want to change, or simply keep them there. If you keep them without change, nothing will change, just some extra underlying processing.
5. The second column will be case insensitive.
How does this sound? I will include it in the next release.
Last edit: Chaomei Chen 2022-11-08
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi Chaomei,
I am working on a project with a large dataset from WebofScience and Scopus. Because of this, I have to make a very long alias list to account of slight variations in spelling, abbreviation, and capitalization of institutions. Is there any way to make the analysis case INsensitive, to reduce the work to be done on the alias list? It seems like countries are all case insensitive, but I am having trouble consolidating the institutions based on differences in what is capitalized.
Thank you!
Claire Sorenson
Thanks for your suggestion. I am making the following changes:
1. Add a new function to generate a template alias file, containing all the nodes in two columns separated by a tab. The file name is template_citespace.alias.
2. Visualize the network as usual, then generate the template from menu: Export > Nodes: Generate an Alias Template
3. Users can modify the template file as a plain text in Notepad++ or as a TSV file in Excel and save it as citespace.alias for CiteSpace to use.
4. Modify the template in two ways: a): Modify the first column, which is how it will appear, and/or 2) Remove rows you don't want to change, or simply keep them there. If you keep them without change, nothing will change, just some extra underlying processing.
5. The second column will be case insensitive.
How does this sound? I will include it in the next release.
Last edit: Chaomei Chen 2022-11-08
That sounds like a great format! I love the alias function, and this will be very helpful to not having to make so many. Thank you!