Menu

Issue with Largest cc

2019-05-06
2019-05-18
  • Stephan De Spiegeleire

    We are trying to figure out all of the options of CiteSpace (using
    references as nodes) on a small dataset we created with two citing articles
    that each contain the same 10 cited references. We have set all parameters
    as 'expansive' as possible: top 100% of cited articles, k=100, no
    thresholds for c, cc and ccv for all 3 time slices, one time slice for all
    years. So we would expect a visualization containing 10 nodes with 45
    edges. But we only get 8 nodes visualized with (of course) 28 edges. Could
    you please explain this? Where does it get that 'Largest CC:8 (80%)'? You
    would save us (and maybe others) a lot of time if you could quickly explain
    this. Thanks much!

    Kindest regards,

    -Stephan

     
  • Chaomei Chen

    Chaomei Chen - 2019-05-07

    What are the values of Network: N=? and E=? in the signature of the upper left corner? If it says 8 accounts for 80%, that means the network does have 10 nodes in total. The 20% that is not shown because the default setting shows the LCC initially only. To view all the nodes, menu Filters > uncheck Show the Largest Connected Component Only.
    To have an all-inclusive and all-visible network, check the following settings:
    Project > Edit Properties:
    Link Retaining Factor: -1
    Nodes (e): 0
    Look Back Years: -1
    Selection Criteria:
    Top N%=100
    The maximum number of selected items per slice: make this large enough to cover, e.g. the total number of records
    Filter:
    Show the Largest Connected Component Only: Off
    Let me know how it goes.
    Here is an example when the Show Largest Connected Component Only is turned off.
    Here is an example when the Show Largest Connected Component Only is turned off.

     

    Last edit: Chaomei Chen 2019-05-07
  • Stephan De Spiegeleire

    Thanks for getting back to us. But unfortunately, that still doesn't seem to do the trick. Here are the settings we used:



    But as you can (hopefully) tell, CS still only visualizes 8 nodes. BTW, if you want to replicate this, here are the files we used.

     

    Last edit: Stephan De Spiegeleire 2019-05-07
  • Chaomei Chen

    Chaomei Chen - 2019-05-08

    Yes, there is indeed another reason - I forgot about it myself until I checked the code. There is a filter that caps the number of links a node may have. In the current version, it is set at no more than 75% of the total number of nodes, which explains why 8 nodes get connected to each other. The two isolated nodes are there. Move the node size slider with a black background, you should be able to see all 10 nodes but less than 45 links for the above reason. In a larger and more realistic dataset, link weights would be typically different enough so that stronger links are retained. The end users currently cannot change this setting. I will add a user control to the Edit Properties in the next version.
    Here is what it looks like if the filter is turned off:
    Maximum link per node

     

    Last edit: Chaomei Chen 2019-05-08
  • Stephan De Spiegeleire

    Thanks much! And yes - that one looks perfect. We look forward to being able to generate similar vizzes in a next version - also for much bigger and more realistic datasets.
    We do realize that this is not a 'realistic' dataset - but we just kept it artificially small to be able to fully understand how CiteSpace works. And we just didn't understand why those 8 (and not the other two) were selected AND/OR why not all edges were displayed. But so now we do understand and look forward to being able to also change that setting.
    And just to be clear - our main intent is to really map the entire 'knowledge landscape' for the topics we're interested in. We'd rather wait longer to see everything, than being more computationally efficient at the expense of missing potentially important nodes. And so this limited example suggested to us that that restriction (capping the number of links a node may have to no more than 75% of the total number of nodes) was just too random: why those 8 and not those two? But I do get your point that in more 'realistic' datasets, this choice to just retain the strongest links MAY make sense. And many (maybe even most) users of CiteSpace may be happy with that restriction. But we'd much rather have the option to look at the results both ways - both WITH that restriction and without.
    But please rest assured that we remain great fans of the program AND of your efforts to give us as users the greatest freedom possible to adjust all parameters! So thanks much for clearing this up and for giving us this additional option in the next version..
    [Oh and BTW one more comment: I myself am still having problems getting the graph visualization in the center of that window. I'm not sure whether this is due to the fact that I have an ultrawide monitor or not. But all I can say is that, for instance, for that viz I copied in the previous message, I had to look all over the place to 'find it'. It really would be great to have a button to put the entire graph viz in the center of the window]

     
    👍
    1

    Last edit: Stephan De Spiegeleire 2019-05-08
  • Chaomei Chen

    Chaomei Chen - 2019-05-08

    I am uploading the new version 5.4.R3 with a quick fix for now. I will add that button soon...

     
  • Stephan De Spiegeleire

    Wow :) ! Thanks!

     
  • Stephan De Spiegeleire

    Trying out the new version. The network modelling is taking a LOT longer than before...

     
  • Stephan De Spiegeleire

    Correction: it DOES work now. So thanks so much, Chaomei!

     

    Last edit: Stephan De Spiegeleire 2019-05-08
  • Chaomei Chen

    Chaomei Chen - 2019-05-08

    Thanks for various feedback, Stephan. I am going to travel soon. Let me take care of it afterwards.

     
  • Stephan De Spiegeleire

    Update (confirmed by a number of our team members) - the new version now doesn't process bigger projects anymore.

     
    • Chaomei Chen

      Chaomei Chen - 2019-05-17

      I just uploaded a new version. Do you want to try it?

       
      • Stephan De Spiegeleire

        Yes - we seem to be back in business. Thanks!

         

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.