Advertisement
adiee

Untitled

Jul 4th, 2021
75
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 9.95 KB | None | 0 0
  1. Transcript
  2.  
  3.  
  4. 00:05
  5. [electronic music]
  6. 00:17
  7. Sandeep: Hi. My name is Sandeep,
  8. 00:19
  9. a developer advocate on the Google Cloud platform.
  10. 00:22
  11. Welcome to the Google Data Center
  12. 00:23
  13. at the Dalles, Oregon.
  14. 00:25
  15. Take a look around.
  16. 00:27
  17. Before we go inside, we need to make sure
  18. 00:29
  19. that we have the appropriate security clearance.
  20. 00:31
  21. Most Google employees can't even get in here.
  22. 00:34
  23. So let's go on a special behind-the-scenes tour.
  24. 00:39
  25. [keypad beeps, door opens]
  26. 00:44
  27. I'm here with Noah
  28. 00:46
  29. from the Site Reliability Engineering Team.
  30. 00:48
  31. Noah, can you tell us a little bit more
  32. 00:49
  33. about the SRE role at Google?
  34. 00:51
  35. Noah: Yeah, SREs write and maintain the software systems
  36. 00:53
  37. designed to keep our services running.
  38. 00:55
  39. Sandeep: So what happens if one of these systems goes down?
  40. 00:57
  41. Noah: We've designed our systems from the ground up
  42. 01:00
  43. to be able to handle any unexpected failures
  44. 01:02
  45. that might occur.
  46. 01:03
  47. We have highly redundant power, networking,
  48. 01:05
  49. and serving domains so that even if we do lose
  50. 01:07
  51. an entire cluster, we're able to re-direct those workloads
  52. 01:10
  53. and live migrate data in order to minimize any impact.
  54. 01:13
  55. In addition, we have a team of SREs on call 24/7
  56. 01:17
  57. that can tackle any problems that might arise.
  58. 01:20
  59. Sandeep: Thanks, Noah.
  60. 01:21
  61. Now we've learned more about the systems
  62. 01:23
  63. that manage our fleet at Google,
  64. 01:24
  65. let's take a deeper look
  66. 01:25
  67. at the data center infrastructure itself.
  68. 01:29
  69. Before we can continue further,
  70. 01:31
  71. we need to go through the biometric iris scan
  72. 01:33
  73. and circle lock.
  74. 01:34
  75. These only allow one person in at a time
  76. 01:37
  77. and require dual authentication
  78. 01:38
  79. to continue further.
  80. 01:40
  81. I'll see you on the other side.
  82. 01:44
  83. [control beeps]
  84. 01:45
  85. computer voice: Please situate your eyes
  86. 01:47
  87. to begin the procedure.
  88. 01:52
  89. Please come a little closer to the camera.
  90. 01:55
  91. [beep]
  92. 01:56
  93. Sandeep: Welcome to the data center floor.
  94. 01:59
  95. As you can tell, we have a lot of servers,
  96. 02:01
  97. and this is a single cluster in a single floor
  98. 02:04
  99. in a single building.
  100. 02:06
  101. Managing all of these servers on a global scale
  102. 02:08
  103. is quite a challenge.
  104. 02:11
  105. To utilize our fleet, we use tools
  106. 02:13
  107. such as Borg, Colossus, and Spanner.
  108. 02:16
  109. You may be familiar with similar tools,
  110. 02:18
  111. such as Kubernetes, Google Cloud storage,
  112. 02:20
  113. and BigQuery.
  114. 02:22
  115. These tools allow Google engineers
  116. 02:24
  117. and Cloud customers
  118. 02:25
  119. to more easily manage infrastructure,
  120. 02:27
  121. allowing everyone to build
  122. 02:28
  123. innovative and scalable applications.
  124. 02:36
  125. Here at Google, a lot of our infrastructure is custom-made.
  126. 02:39
  127. This gives us the flexibility and performance
  128. 02:42
  129. we need to run all of our services at scale.
  130. 02:44
  131. Oh, hey, it's Virginia, one of our network engineers.
  132. 02:46
  133. Virginia: Hey, Sandeep.
  134. 02:47
  135. Sandeep: Virginia, what are you working on today?
  136. 02:49
  137. Virginia: Today I'm working with Hardware Ops
  138. 02:50
  139. to expand this data center network
  140. 02:52
  141. to deploy additional machines in this building.
  142. 02:54
  143. Our fleet is constantly growing to support new capacity
  144. 02:57
  145. for Google products and our Cloud customers.
  146. 03:00
  147. Sandeep: That sounds like a lot of work,
  148. 03:01
  149. to be constantly adding capacity around the globe.
  150. 03:03
  151. Virginia: Well, we designed our network
  152. 03:05
  153. so that this kind of capacity growth isn't very hard.
  154. 03:07
  155. Jupiter, our current data center and network technology,
  156. 03:10
  157. is a hierarchical design
  158. 03:12
  159. using software-defined networking principles.
  160. 03:14
  161. So just like with our servers,
  162. 03:16
  163. we abstracted away the specific details of our network
  164. 03:19
  165. and can manage them like they're software programs and data.
  166. 03:22
  167. Sandeep: Abstracting seems to be a common theme here at Google.
  168. 03:25
  169. I've also noticed there's a lot of fiber
  170. 03:26
  171. running in our data centers. Virginia: That's right.
  172. 03:28
  173. A single building can support 75,000 machines,
  174. 03:31
  175. and carry over one petabit per second of bandwidth,
  176. 03:34
  177. which is actually more than on the entire Internet.
  178. 03:37
  179. Sandeep: Wow. Virginia: This allows us
  180. 03:38
  181. to reliably access storage and compute resources
  182. 03:41
  183. with low latency and high throughput.
  184. 03:43
  185. Sandeep: So how is this data center connected to
  186. 03:45
  187. all our other data centers around the globe?
  188. 03:47
  189. Virginia: Google runs B4,
  190. 03:49
  191. our own private, highly efficient backbone network,
  192. 03:52
  193. which is actually growing faster
  194. 03:53
  195. than our Internet-facing network.
  196. 03:55
  197. It connects all our data centers together
  198. 03:57
  199. and allows services to efficiently access resources
  200. 04:00
  201. in any location. Sandeep: Nice.
  202. 04:02
  203. I finally know what all this Google fiber is really used for.
  204. 04:04
  205. Thanks, Virginia. Virginia: No problem.
  206. 04:07
  207. Sandeep: So now you've seen
  208. 04:08
  209. all the compute and networking horsepower
  210. 04:10
  211. required to run your workloads in the Cloud,
  212. 04:12
  213. let's take a look at where your data is
  214. 04:14
  215. safely and securely stored.
  216. 04:16
  217. Let's go.
  218. 04:19
  219. Whether you're querying terabytes of data on BigQuery
  220. 04:22
  221. or storing petabytes in Google Cloud Storage,
  222. 04:25
  223. all of your data needs to be stored on a physical device.
  224. 04:28
  225. Our data center infrastructure allows us
  226. 04:30
  227. to access our storage quickly and securely.
  228. 04:33
  229. At our scale, we need to handle hard drive and SSD failure
  230. 04:37
  231. on a daily basis.
  232. 04:38
  233. While your data is replicated and safe,
  234. 04:40
  235. we need to destroy or recycle used hard drives
  236. 04:43
  237. so no one can access your data.
  238. 04:45
  239. From the time a disc is removed from the server
  240. 04:47
  241. to the time it's decommissioned,
  242. 04:49
  243. we maintain a very strict chain of custody.
  244. 04:52
  245. The discs are completely wiped and then destroyed
  246. 04:54
  247. in a huge shredder.
  248. 04:56
  249. Let's go shred some hard drives.
  250. 05:06
  251. [beeping]
  252. 05:17
  253. We've looked at a lot of the hardware
  254. 05:19
  255. that runs in our data centers, but it doesn't end there.
  256. 05:22
  257. We need to cool and power our infrastructure
  258. 05:25
  259. in an environmentally sustainable and reliable way.
  260. 05:28
  261. Let's take a look at how we cool our servers.
  262. 05:33
  263. Welcome to the mechanical equipment room.
  264. 05:35
  265. Looks pretty cool, doesn't it?
  266. 05:38
  267. Oh, hey, it's Brian, one of
  268. 05:39
  269. our data center facilities technicians!
  270. 05:40
  271. Brian: Hey, Sandeep. Sandeep: Hey, Brian.
  272. 05:42
  273. Brian, can you tell us a little bit more about this room?
  274. 05:44
  275. Brian: Sure. This is a cooling plant
  276. 05:46
  277. for one of the data centers that we have on site.
  278. 05:49
  279. So a lot of heat is generated on the server floor,
  280. 05:51
  281. and it all has to be removed,
  282. 05:53
  283. and that starts right here in the cooling plant.
  284. 05:55
  285. So it's basically two loops.
  286. 05:57
  287. We have the condenser water loop
  288. 05:58
  289. and we have the process water loop.
  290. 06:00
  291. The process water loop are these blue and red pipes over here.
  292. 06:03
  293. So they take the heat off the server floor
  294. 06:05
  295. and they transfer it to these heat exchangers here.
  296. 06:08
  297. The condenser water loop are
  298. 06:09
  299. these green and yellow pipes here.
  300. 06:11
  301. They take the cold water from the basin underneath us,
  302. 06:14
  303. they transfer it to these heat exchangers here,
  304. 06:16
  305. and they send it up to the cooling towers up on the roof.
  306. 06:18
  307. Sandeep: I notice our pipes are Google colors.
  308. 06:20
  309. It's pretty cool.
  310. 06:22
  311. So how efficient is our data center?
  312. 06:24
  313. Brian: Well, Google has some of
  314. 06:25
  315. the most efficient data centers in the world.
  316. 06:27
  317. In fact, when we started reporting our power usage effectiveness
  318. 06:30
  319. or P.U.E., in 2008,
  320. 06:32
  321. most data centers were around 100% overhead.
  322. 06:36
  323. At that point in time, Google was 20% overhead,
  324. 06:38
  325. but since then, we've reduced it to just 12%,
  326. 06:41
  327. and that even includes our cafeterias.
  328. 06:43
  329. Sandeep: Whoa! That is so low!
  330. 06:46
  331. Also what's this big green machine for?
  332. 06:48
  333. Brian: Oh, well, this is a chiller.
  334. 06:50
  335. We very rarely use them,
  336. 06:51
  337. but it helps keep the process water temperature
  338. 06:53
  339. in the desired temperature range
  340. 06:55
  341. when it gets really hot outside,
  342. 06:57
  343. basically helping the cooling tower do its job,
  344. 06:59
  345. and some of our newer data centers,
  346. 07:01
  347. they have no chillers at all.
  348. 07:03
  349. Sandeep: I love how our new data centers are even more efficient.
  350. 07:06
  351. By the way, can we go up and take a look at a cooling tower?
  352. 07:09
  353. Brian: Sure. Let's go.
  354. 07:13
  355. Sandeep: Wow, what a view up here!
  356. 07:16
  357. Brian: So, Sandeep, this is a cooling tower.
  358. 07:18
  359. It uses evaporation to rapidly cool the water
  360. 07:20
  361. from the condenser loop and sends it back down to the basin.
  362. 07:23
  363. You could say we're making actual clouds with the Cloud.
  364. 07:26
  365. Sandeep: Clouds making actual clouds--welcome to Google!
  366. 07:29
  367. So, Brian, how do we power the Cloud?
  368. 07:31
  369. Brian: Well, that all starts at Google's power substation.
  370. 07:33
  371. Let's go take a look.
  372. 07:38
  373. So this is the Google-owned power substation.
  374. 07:41
  375. This is where the high voltage power enters the site.
  376. 07:43
  377. It's reduced and then sent
  378. 07:45
  379. to multiple power distribution centers
  380. 07:46
  381. such as this one right here.
  382. 07:48
  383. Sandeep: What happens if a power distribution center
  384. 07:49
  385. loses power?
  386. 07:50
  387. Brian: If it loses power, we have
  388. 07:52
  389. multiple generator and utility backup sources
  390. 07:54
  391. available to maintain power to those servers.
  392. 07:56
  393. Sandeep: And where does all the power come from?
  394. 07:58
  395. Brian: It actually comes from
  396. 07:59
  397. multiple hydroelectric power plants
  398. 08:01
  399. that are nearby. Sandeep: I love how Google uses
  400. 08:03
  401. reliable green energy whenever possible.
  402. 08:06
  403. Brian: We are 100% carbon neutral actually.
  404. 08:08
  405. Sandeep: That's pretty cool
  406. 08:09
  407. You know, it seems like Google builds reliability
  408. 08:11
  409. from the ground up, from the power and cooling
  410. 08:13
  411. all the way to the software systems that manage our fleet.
  412. 08:16
  413. Thanks for showing me around, Brian.
  414. 08:17
  415. Brian: No problem. Have a great day.
  416. 08:19
  417. Sandeep: Thank you for joining me on
  418. 08:20
  419. this special behind-the-scenes tour.
  420. 08:22
  421. Please check out cloud.google.com
  422. 08:24
  423. to learn how you can build what's next.
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement