We've got a site with about a dozen Win 2k3 servers running NC_Net with no problem - but one of them's just started falling over as soon as it's started.
Start the service, and it responds correctly - briefly. Then DW20 kicks in, 100% CPU for a minute, and NC_Net service stops.
EventVwr is showing these errors :-
---
Event Type: Warning
Event Source: NC_Net
Event Category: None
Event ID: 9002
Date: 24/01/2007
Time: 10:03:13
User: N/A
Computer: <Host Name>
Description:
Failed to create instance mypc1:Input string was not in a correct format.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
---
Event Type: Warning
Event Source: NC_Net
Event Category: None
Event ID: 9003
Date: 24/01/2007
Time: 10:03:14
User: N/A
Computer: <Host Name>
Description:
Failed to create instance mypc2:Input string was not in a correct format.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
---
Event Type: Warning
Event Source: NC_Net
Event Category: None
Event ID: 9004
Date: 24/01/2007
Time: 10:03:15
User: N/A
Computer: <Host Name>
Description:
Failed to create instance mypc3:Input string was not in a correct format.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
---
Event Type: Warning
Event Source: NC_Net
Event Category: None
Event ID: 9005
Date: 24/01/2007
Time: 10:03:16
User: N/A
Computer: <Host Name>
Description:
Failed to create instance mypc4:Input string was not in a correct format.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
---
Event Type: Information
Event Source: NC_Net
Event Category: None
Event ID: 1004
Date: 24/01/2007
Time: 10:03:16
User: N/A
Computer: <Host Name>
Description:
NC_Net Service Started :
Date: 24/01/2007 10:03:16;
I've repaired the dotNet install, no difference.
I've tried to repair the NC_Net install, it fails - "The specified service already exists" (No, really...?)
I've tried to uninstall the NC_Net install, it fails - "An exception occurred while uninstalling. This exception will be ignored and the uninstall will continue. However, the application might not be fully inunstalled after the uninstall is complete --> The savedState dictionary contains inconsistent data and might have been corrupted."
Nagios' checks haven't been changed, and are the exact same checks running on a bunch of other servers with no problems. <touches wood quickly>
Any thoughts...? Everything else on the server (DNS/DHCP/AD domain controller) are all working fine.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Just manually uninstalled NC_Net - deleted the Program Files folder, gone through the registry and removed every reference (apart from the various HK-L-M/System/<each of the control sets>/enum/root/LEGACY_NC_NET, which wouldn't delete), restarted, reinstalled (smooth)... and it still does exactly the same.
At least this time, it did de-install (from Control Panel/Add-Remove), restart, reinstall.
STILL the same...
Would setting fire to the server help, d'you think...?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Reinstalling rarely fixes NC_NEt issues (unless it is upgrading versions).
Odds are this issue may be due to the Windows Application log settings,
please check the properties of the Application log, make sure it is not full and setup to overwrite as needed. and then clear the application log. then try starting NC_NEt.
Note, I have had issues with the repair option of the Installer. It has never helped on issues, and whenever I have tried it I also needed to uninstall via the Registry.
please contact me at NC_net@montitech.com to follow up on this, then once a solution is found it can be posted.
TOny
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
We had a test/dev XP box fill the App log, and it didn't produce any of this Event Viewer info. 2k3 has the logs set by default to rotate as needed - so that's definitely not the problem here.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
It looks like this is NC_Net merely being a symptom - the server seems to have corrupted it's performance monitor counters. Opening PerfMon doesn't give the usual list of categories and counters, merely a list of numbers. NC_Net is failing to read the counters it requires, and shutting down.
Thanks to Tony for his assistance in diagnosis.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Sorry to resurrect this old thread, but I was having this problem and it was painful to track down.
Here's the MS KB article. http://support.microsoft.com/kb/300956
On win2k3, cd to "windows\system32" and run "lodctr /R" to rebuild the counters.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thank you for the post I have been aware of the source being external to NC_NEt where the Counters on the server were corupted however on prior research I had not found a best easy to implement solution for repairing the performance counters.
Thank you for this contribution.
RE: NC_Net crashing on one server (New)
By: James McPhee (jmcphee) - 2008-01-31 18:57
Sorry to resurrect this old thread, but I was having this problem and it was painful to track down.
Here's the MS KB article. http://support.microsoft.com/kb/300956
On win2k3, cd to "windows\system32" and run "lodctr /R" to rebuild the counters.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
We've got a site with about a dozen Win 2k3 servers running NC_Net with no problem - but one of them's just started falling over as soon as it's started.
Start the service, and it responds correctly - briefly. Then DW20 kicks in, 100% CPU for a minute, and NC_Net service stops.
EventVwr is showing these errors :-
---
Event Type: Warning
Event Source: NC_Net
Event Category: None
Event ID: 9002
Date: 24/01/2007
Time: 10:03:13
User: N/A
Computer: <Host Name>
Description:
Failed to create instance mypc1:Input string was not in a correct format.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
---
Event Type: Warning
Event Source: NC_Net
Event Category: None
Event ID: 9003
Date: 24/01/2007
Time: 10:03:14
User: N/A
Computer: <Host Name>
Description:
Failed to create instance mypc2:Input string was not in a correct format.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
---
Event Type: Warning
Event Source: NC_Net
Event Category: None
Event ID: 9004
Date: 24/01/2007
Time: 10:03:15
User: N/A
Computer: <Host Name>
Description:
Failed to create instance mypc3:Input string was not in a correct format.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
---
Event Type: Warning
Event Source: NC_Net
Event Category: None
Event ID: 9005
Date: 24/01/2007
Time: 10:03:16
User: N/A
Computer: <Host Name>
Description:
Failed to create instance mypc4:Input string was not in a correct format.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
---
Event Type: Information
Event Source: NC_Net
Event Category: None
Event ID: 1004
Date: 24/01/2007
Time: 10:03:16
User: N/A
Computer: <Host Name>
Description:
NC_Net Service Started :
Date: 24/01/2007 10:03:16;
Version: NC_Net 3.05 03/10/06;
Script Path:C:\Program Files\Montitech\NC_Net\script\;
NC_Net Config Path: C:\Program Files\Montitech\NC_Net\config\;
Startup Config: C:\Program Files\Montitech\NC_Net\config\startup.cfg;
Passive Config: C:\Program Files\Montitech\NC_Net\config\passive.cfg;
Passive Check Log:C:\Program Files\Montitech\NC_Net\config\passive.log;
Active_check: True;
Lock_active_config: True;
Port: 1248;
Pass: None;
Active_timeout: 20;
Active_ip_accept_list: False;
Passive_check: False;
Lock_passive_config: False;
Port_passive: 5667;
Host_passive: "NC_Net_host_ID";
Ip_passive: "127.0.0.1";
Pass_passive: ;
Encrip_passive: 1;
Interval_passive: 5;
Interval_div_passive: 1;
Passive_alwayson:False;
Passive_timeout:10;
Cpu_max_interval: 60;
cpu_single: False;
perfdata_format: 2;
Testrun: False;
Embedded_send_nsca: True;
External_send_nsca: False;
External_send_nsca_app: ;
External_send_nsca_cfg: ;
External_send_nsca_ip: ;
External_send_nsca_port: 5667;
External_send_nsca_timeout: 10;
allow_run_scripts: False;
script_timeout: 30;
do_not_blaim_nc_net: False
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
---
Event Type: Error
Event Source: .NET Runtime 2.0 Error Reporting
Event Category: None
Event ID: 5000
Date: 24/01/2007
Time: 10:03:20
User: N/A
Computer: <Host Name>
Description:
EventType clr20r3, P1 nc_net.exe, P2 0.3.2260.22905, P3 4411bac3, P4 mscorlib, P5 2.0.0.0, P6 4333ab80, P7 bd0, P8 59, P9 system.formatexception, P10 NIL.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
Data:
0000: 63 00 6c 00 72 00 32 00 c.l.r.2.
0008: 30 00 72 00 33 00 2c 00 0.r.3.,.
0010: 20 00 6e 00 63 00 5f 00 .n.c._.
0018: 6e 00 65 00 74 00 2e 00 n.e.t...
0020: 65 00 78 00 65 00 2c 00 e.x.e.,.
0028: 20 00 30 00 2e 00 33 00 .0...3.
0030: 2e 00 32 00 32 00 36 00 ..2.2.6.
0038: 30 00 2e 00 32 00 32 00 0...2.2.
0040: 39 00 30 00 35 00 2c 00 9.0.5.,.
0048: 20 00 34 00 34 00 31 00 .4.4.1.
0050: 31 00 62 00 61 00 63 00 1.b.a.c.
0058: 33 00 2c 00 20 00 6d 00 3.,. .m.
0060: 73 00 63 00 6f 00 72 00 s.c.o.r.
0068: 6c 00 69 00 62 00 2c 00 l.i.b.,.
0070: 20 00 32 00 2e 00 30 00 .2...0.
0078: 2e 00 30 00 2e 00 30 00 ..0...0.
0080: 2c 00 20 00 34 00 33 00 ,. .4.3.
0088: 33 00 33 00 61 00 62 00 3.3.a.b.
0090: 38 00 30 00 2c 00 20 00 8.0.,. .
0098: 62 00 64 00 30 00 2c 00 b.d.0.,.
00a0: 20 00 35 00 39 00 2c 00 .5.9.,.
00a8: 20 00 73 00 79 00 73 00 .s.y.s.
00b0: 74 00 65 00 6d 00 2e 00 t.e.m...
00b8: 66 00 6f 00 72 00 6d 00 f.o.r.m.
00c0: 61 00 74 00 65 00 78 00 a.t.e.x.
00c8: 63 00 65 00 70 00 74 00 c.e.p.t.
00d0: 69 00 6f 00 6e 00 20 00 i.o.n. .
00d8: 4e 00 49 00 4c 00 0d 00 N.I.L...
00e0: 0a 00 ..
---
I've repaired the dotNet install, no difference.
I've tried to repair the NC_Net install, it fails - "The specified service already exists" (No, really...?)
I've tried to uninstall the NC_Net install, it fails - "An exception occurred while uninstalling. This exception will be ignored and the uninstall will continue. However, the application might not be fully inunstalled after the uninstall is complete --> The savedState dictionary contains inconsistent data and might have been corrupted."
Nagios' checks haven't been changed, and are the exact same checks running on a bunch of other servers with no problems. <touches wood quickly>
Any thoughts...? Everything else on the server (DNS/DHCP/AD domain controller) are all working fine.
Just manually uninstalled NC_Net - deleted the Program Files folder, gone through the registry and removed every reference (apart from the various HK-L-M/System/<each of the control sets>/enum/root/LEGACY_NC_NET, which wouldn't delete), restarted, reinstalled (smooth)... and it still does exactly the same.
At least this time, it did de-install (from Control Panel/Add-Remove), restart, reinstall.
STILL the same...
Would setting fire to the server help, d'you think...?
hi,
Reinstalling rarely fixes NC_NEt issues (unless it is upgrading versions).
Odds are this issue may be due to the Windows Application log settings,
please check the properties of the Application log, make sure it is not full and setup to overwrite as needed. and then clear the application log. then try starting NC_NEt.
Note, I have had issues with the repair option of the Installer. It has never helped on issues, and whenever I have tried it I also needed to uninstall via the Registry.
please contact me at NC_net@montitech.com to follow up on this, then once a solution is found it can be posted.
TOny
Tony,
eMail sent.
We had a test/dev XP box fill the App log, and it didn't produce any of this Event Viewer info. 2k3 has the logs set by default to rotate as needed - so that's definitely not the problem here.
Update :-
It looks like this is NC_Net merely being a symptom - the server seems to have corrupted it's performance monitor counters. Opening PerfMon doesn't give the usual list of categories and counters, merely a list of numbers. NC_Net is failing to read the counters it requires, and shutting down.
Thanks to Tony for his assistance in diagnosis.
I had the same problem. Thanks for the pointer!
Sorry to resurrect this old thread, but I was having this problem and it was painful to track down.
Here's the MS KB article.
http://support.microsoft.com/kb/300956
On win2k3, cd to "windows\system32" and run "lodctr /R" to rebuild the counters.
Thank you for the post I have been aware of the source being external to NC_NEt where the Counters on the server were corupted however on prior research I had not found a best easy to implement solution for repairing the performance counters.
Thank you for this contribution.
RE: NC_Net crashing on one server (New)
By: James McPhee (jmcphee) - 2008-01-31 18:57
Sorry to resurrect this old thread, but I was having this problem and it was painful to track down.
Here's the MS KB article.
http://support.microsoft.com/kb/300956
On win2k3, cd to "windows\system32" and run "lodctr /R" to rebuild the counters.