Skip to main content

Translate

Mencegah titik panas (hotspot) data center dengan temperature monitoring


Cooling failures and overheated servers have even worse consequences than power failures in most mission-critical data centers. A well-maintained uninterruptible power supply should keep servers operating until generators kick in, power is restored or an orderly shutdown occurs. But in today's world of high-density hardware and elevated operating temperatures, a cooling failure -- even with supposedly redundant air conditioners -- can cause server crashes in seconds. Use data center temperature monitoring to avoid data center hot spots that lead to early hardware failures and unexplainable data errors.

How data center hot spots occur

Hot spots are insidious; they can unknowingly creep up on you until equipment starts to fail or strange data anomalies appear. If you add or move equipment around without real knowledge of the room's cooling capacities, hot spots can occur. In nearly every data center, cooling capacities vary at different locations in the room and at different positions along the rack height. Since hot spots usually occur slowly, they can easily go unrecognized until it gets serious.

Find hot spots through data center temperature monitoring

The easiest and least expensive way to find data center hot spots is with temperature-indicating blanking panels. The multi-colored strips on these panels are heat-sensitive, and provide a visual indication of inlet air temperatures. Mount them near the top, middle and bottom of each rack, or at least in every other rack. Alternatively, mount temperature probes in front of hardware, close to the top, middle and bottom of racks. If you can only afford one per rack, put it in front of the most vulnerable hardware, which is usually the highest server in the rack.
Data center temperature and humidity probes are available as add-ons to smart rack power distribution units, as individual wireless devices and as part of some data center infrastructure management systems. All three offer software options that give real-time graphical displays of temperature conditions throughout the room. Ultimately, you should combine these readouts with computational fluid dynamics (CFD) air flow modeling, which allows you to verify cooling adequacy by simulating the proposed new installation before equipment is even installed.
Many data centers invest in redundant cooling units but don't actually have redundant cooling; sometimes it's just poor design. Some computer room air conditioning units have insufficient knowledge of how air really moves in a data center, causing even worse cooling conditions. In modern designs, redundant units run simultaneously with normal units, but at reduced speed, so you don't realize added servers are stealing redundant capacity until a cooling unit fails or is turned off for maintenance.
Thankfully, servers can tolerate a higher operating temperature for several days with little negative effect. ASHRAE's allowable thermal envelope goes up to 32 degrees Celsius or 89.6 degrees Fahrenheit in emergencies, but marginal redundancy -- combined with poorly planned computing hardware additions -- can cause serious overheating and thermal shutdowns within a short time after a cooling unit has quit.

Prevent data center cooling failures

Some think a solution is placing redundant coolers next to normal coolers in a raised floor design, but that's not dependable. When air emanates from different locations, there will be some difference in air flow pattern when a normal or redundant unit is operating, or when both run together. This seemingly small difference causes data center temperature variations that can result in significant hot spots.

Three causes of hot spots

  • New equipment adds too much heat. This could be due to insufficient cooling capacity, inadequate air flow, or both, but it's an endemic problem and no amount of adjustment is going to solve it.
  • Hardware is installed too high in the rack in an under-floor air facility, particularly one without containment. Air delivered from the floor increases in temperature as it rises, so is warmer than design temperature by the time it reaches the upper equipment. Without containment, hot discharge air recirculates from the backs of the racks and over the tops, where it mixes with air that is already marginally cool.
  • Organizations fail to adjust floor tiles in under-floor cooling environments, or discharge grills in overhead designs, to supply the amount of air needed to cool the added load.

Thermal indicators are a good first step, but it's impractical to turn off cooling units every time hardware changes just to see what overheats. The best way to avoid problems, particularly in redundant designs, is to model the cooling with CFD, which creates a 3D model of the data center, including specific cooling systems and rack heat loads. The program uses this information to solve thousands of complex partial differential equations that form an analysis of the air flow. The model delivers both color-coded graphics and data tables showing air quantity, velocity, temperature and pressure at every point in the room, plus under-floor in raised floor installations. It is then easy to see where extra cooling capacity exists and add new equipment there. It's also easy to fail a cooling unit in the model, rerun the computations, and see how well the redundancy works.

source: https://searchdatacenter.techtarget.com/tip/Dont-get-burned-by-data-center-hot-spots

Comments

Popular posts from this blog

Timer AC bergantian

Bagaimana sich prinsip kerja AC yang bergantian? Seperti yang terangkai pada ACPDB, yang kita butuhkan adalah 1 buah timer dan 2 buah kontaktor. Pada dasarnya rangkaiannya adalah seperti gambar diatas. Seperti kita ketahui, timer dan kontaktor akan bekerja apabila mendapatkan catuan 220 V. Pada timer catuan bisa dikoneksikan di lubang “L” dan “N”, sedang pada kontaktor dilubang “A1” dan “A2”. Itulah kenapa pada saat mati listrik komponen2 tersebut tidak bekerja. Timer berfungsi sebagai switch dari 2-1 atau 2-3 dan lubang “2” sebagai sumber yang dialiri arus listrik. Sesuai namanya alat ini akan bergantian dari 2-1 atau 2-3 berdasarkan waktu yang sudah kita atur pada sirip biru. Satu sirip merepresentasikan 30 menit. Sedang pada kontaktor untuk tipe Telemecanique, sumbu-sumbu saklarnya adalah 1-2, 3-4, 5-6, NO-NO, NC-NC.  Jika “A1” dan “A2” tidak dicatu maka 1-2 (open), 3-4 (open), 5-6 (open), NO-NO (open), NC-NC (close/terhubung). Dan bila “A1” dan “A2” dicatu  maka 1-2 (close), 3-4 (clo…

Pemilihan jenis modul AMF (Automatic Main Failure)

Pemilihan jenis modul AMF (Automatic Main Failure)

Dari sekian banyak jenis modul yang ada dipasaran, kami menawarkan beberapa alternatif untuk jenis modul AMF yang dapat kami sediakan. antara lain menggunakan modul dari DEEPSEA, smart relay atau timer. tentunya dengan beragam pilihan tersebut ada beberapa keuntungan atau kelebihan dari masing masing modul tersebut. sebagai misal menggunakan modul AMF dari DEEPSEA akan sangat cocok jika modul ini dapat berinterakasi langsung dengan genset. terutama jika mesin genset belum dilengkapi genset controller dan masih mengandalkan panel genset manual. untuk pabrikan genset sekarang ini pada panel genset sudah dilengkapi dengan AMF, sehingga tidak perlu panel AMF - ATS dengan menggunakan modul DEEPSEA atau sejenisnya. sehingga dalam pemilihanya dapat menggunakan timer, smart relay atau produk lain seperti ATS controller C20 dari socomec.


Modul DEEPSEA 4420

AMF Module DSE 4420 Modul ini memiliki banyak fitur antara lain ; Start / stop gensetDigital …

Contoh Panduan Standarisasi Area Data Center

Berikut adalah contoh Panduan Standarisasi Area Data Center

PANDUAN - IK Standarisasi Area Data Center Article Number: 49 | Rating: Unrated | Last Updated: Mon, Nov 25, 2013 at 2:13 PM BAB I KEBIJAKAN
1.1.Area Data Center
Areadata center termasuk aset vital perusahaan dan diperlakukan sesuai dengan persyaratan yang telah ditetapkan dalam Sistem Manajemen Pengamanan Perusahaan.

Seksi Jaringan bertanggungjawab terhadap pengamanan fisik dan logik. sedangkan fungsi Sekuriti terhadap pengamanan fisik.


1.2.Pertimbangan Dalam Hal Penentuan Lokasi Area Data Center
Beberapa pertimbangan yang harus ada dalam menentukan lokasi ruang data center, yaitu :

1.Memungkinkan untuk pengembangan yang memadai, misalnya mempertimbangkan pengembangan untuk jangka waktu 5 (lima) tahun ke depan.
2.Mempertimbangkan ruang yang tidak "terlalu” banyak dilalui untuk operasional lain, namun tetap dapat dijangkau dengan mudah.
3.Memperhatikan aspek keamanan dan keselamatan pekerja.
4.Memenuhi persyaratan sebagaimana yang …