Lundmark
2008-12-22, 05:48
I have become interested in making a computer that is resistant to the normal causes of service failure and downtime.
most common sources of downtime:
Software: A process runs off consuming all processing power or system memory, or the computer becomes unresponsive for some other reason.
Hardware: Dust clogs heatsinks causing the components to heat up until failure. RAM goes bad. Harddrive bearings wear out or the hard drive controller board fries. CPU fan bearings wear out. Capacitors burst or leak. Solder points fracture
Power: power going into the computer is cut or surges. Power supply components fail.
Here is what I propose:
A hermetically sealed computer case made of solid 1/4 steel on the motherboard base, and 1 mm steel anywhere else. The motherboard will lay horizontally to remove the stress of the weight on the heatsink on the motherboard.
If the computer hardware doesnt have a problem, the case will be filled with an inert gas to prevent corrosion. There will be no fans inside the case. Since there is still a need for air circulation in the case, an ion fan would be used to circulate air to the device used to exchange heat between the inside of the case and the outside, if the risk of electrostatic damage to the components can be eliminated.
A peltier element would be ideal for removing heat from the inside of the case, but I think those are prone to failure. If heat is an issue an element can be added to the outside of the case.
I think a pico itx or nano itx motherboard would be best for this. These boards have very low power consumption, which means they will run longer on backup power, and they will produce less heat. They physically weigh less so they will be less susceptibal to stress related to vibration.
I cant decide what type of physical storage would be best for this type of computer. I have an old quantum fireball from '97 that still works fine despite a few years of abuse early in its life related to a windows 98 page file. Magnetic hard drives are tried and tested to perform in the crap conditions put on them by 14 year old girls with file sharing programs, but their life span cant be judged by how the drive was used. Google did a study on the failure rate of hard drives and iirc it didnt seem to matter much if the drives were thashing back and forth or idle, if the drive survived the first six months of operation, they would run for several years or more. So hard drives are sturdy devices, but they also are not. Hard drive failures are probably the third most common point of failure in my experience. The thing has moving parts, and moving parts fail eventually. My next idea was to use a solid state drive made up of a good quality usb flash drive, but I dont know how reliable a flash drive would be used like this. The technology is still not proven to be reliable. They deteriorate according to how many times you write data to them for damns sake. Is it possible to have a raid array of usb drives? Does running an os mean the disk will have many write cycles? I also heard it is possible to mount an image of the os on a ram disk, and boot from that. I dont know how it was done, but that sounds much more reliable than solid state drives or a regular old platter disk.
Now for what the fuck software to run. I googled "longest uptime" and came to the netcraft site showing FreeBSD has the top spot on the list. Another site showed OpenVMS (which is closed source) as having the highest up time. Then I found a post on message board by a guy saying bsd dominates the netcraft site because the people running it dont have a reason to reboot, unlike linux users who are upgrading and patching to suit their needs, so there is nothing inherently stable about bsd. Most operating systems including windows variants would not crash if they were running on good hardware is another thing I read today. Any thoughts on what the best operating system to use would be?
Related story:
http://www.theregister.co.uk/2001/04/12/missing_novell_server_discovered_after/
most common sources of downtime:
Software: A process runs off consuming all processing power or system memory, or the computer becomes unresponsive for some other reason.
Hardware: Dust clogs heatsinks causing the components to heat up until failure. RAM goes bad. Harddrive bearings wear out or the hard drive controller board fries. CPU fan bearings wear out. Capacitors burst or leak. Solder points fracture
Power: power going into the computer is cut or surges. Power supply components fail.
Here is what I propose:
A hermetically sealed computer case made of solid 1/4 steel on the motherboard base, and 1 mm steel anywhere else. The motherboard will lay horizontally to remove the stress of the weight on the heatsink on the motherboard.
If the computer hardware doesnt have a problem, the case will be filled with an inert gas to prevent corrosion. There will be no fans inside the case. Since there is still a need for air circulation in the case, an ion fan would be used to circulate air to the device used to exchange heat between the inside of the case and the outside, if the risk of electrostatic damage to the components can be eliminated.
A peltier element would be ideal for removing heat from the inside of the case, but I think those are prone to failure. If heat is an issue an element can be added to the outside of the case.
I think a pico itx or nano itx motherboard would be best for this. These boards have very low power consumption, which means they will run longer on backup power, and they will produce less heat. They physically weigh less so they will be less susceptibal to stress related to vibration.
I cant decide what type of physical storage would be best for this type of computer. I have an old quantum fireball from '97 that still works fine despite a few years of abuse early in its life related to a windows 98 page file. Magnetic hard drives are tried and tested to perform in the crap conditions put on them by 14 year old girls with file sharing programs, but their life span cant be judged by how the drive was used. Google did a study on the failure rate of hard drives and iirc it didnt seem to matter much if the drives were thashing back and forth or idle, if the drive survived the first six months of operation, they would run for several years or more. So hard drives are sturdy devices, but they also are not. Hard drive failures are probably the third most common point of failure in my experience. The thing has moving parts, and moving parts fail eventually. My next idea was to use a solid state drive made up of a good quality usb flash drive, but I dont know how reliable a flash drive would be used like this. The technology is still not proven to be reliable. They deteriorate according to how many times you write data to them for damns sake. Is it possible to have a raid array of usb drives? Does running an os mean the disk will have many write cycles? I also heard it is possible to mount an image of the os on a ram disk, and boot from that. I dont know how it was done, but that sounds much more reliable than solid state drives or a regular old platter disk.
Now for what the fuck software to run. I googled "longest uptime" and came to the netcraft site showing FreeBSD has the top spot on the list. Another site showed OpenVMS (which is closed source) as having the highest up time. Then I found a post on message board by a guy saying bsd dominates the netcraft site because the people running it dont have a reason to reboot, unlike linux users who are upgrading and patching to suit their needs, so there is nothing inherently stable about bsd. Most operating systems including windows variants would not crash if they were running on good hardware is another thing I read today. Any thoughts on what the best operating system to use would be?
Related story:
http://www.theregister.co.uk/2001/04/12/missing_novell_server_discovered_after/