CARVIEW |
Select Language
HTTP/2 200
date: Wed, 08 Oct 2025 19:53:57 GMT
content-type: text/html; charset=utf-8
cache-control: max-age=0, private, must-revalidate
cf-cache-status: DYNAMIC
link: ; rel=preload; as=style; nopush,; rel=preload; as=script; nopush,; rel=preload; as=style; nopush,; rel=preload; as=script; nopush,; rel=preload; as=script; nopush
nel: {"report_to":"heroku-nel","response_headers":["Via"],"max_age":3600,"success_fraction":0.01,"failure_fraction":0.1}
referrer-policy: strict-origin-when-cross-origin
report-to: {"group":"heroku-nel","endpoints":[{"url":"https://nel.heroku.com/reports?s=hO7yJPSBB26AHkFX3Z1ot%2BiAz811YNxGA3Dvkny7JXE%3D\u0026sid=e11707d5-02a7-43ef-b45e-2cf4d2036f7d\u0026ts=1759953237"}],"max_age":3600}
reporting-endpoints: heroku-nel="https://nel.heroku.com/reports?s=hO7yJPSBB26AHkFX3Z1ot%2BiAz811YNxGA3Dvkny7JXE%3D&sid=e11707d5-02a7-43ef-b45e-2cf4d2036f7d&ts=1759953237"
server: cloudflare
strict-transport-security: max-age=0; includeSubDomains
vary: Accept,Accept-Encoding
via: 2.0 heroku-router
x-content-type-options: nosniff
x-permitted-cross-domain-policies: none
x-request-id: c245a02d-054a-86d2-2a25-546d237b08a8
x-runtime: 0.147782
x-xss-protection: 0
content-encoding: gzip
set-cookie: _secure_speakerd_session=ZleOZRV2T%2FrymgnvUQjdE1QA0qaopwVwFpkte9rG0ETPhtiJrj2%2FE1yMSfFkv5dztNtzifY1lvIL6Uk4a1ClInyEablbmoZDT9bUzwsjOc192%2BA0RAcPZH8U5PKfwZkQnErSSo3XoYmTIl%2FHuaOpblDEfOf%2B7ImejvQqIGZTCIeWpnZTPbL5DvkGg%2Bz%2BJt26NyAqPePo5n%2BdKLibaX%2BLeZPWAvmG2i4CAxYNf3VACQdWI0MnpQT1BfH4EeEzoTeRzINk09koiCGzeC68WxI5zq3ieu7A1HqMcIwOrF%2FVgSmw%2Fuporn0qzd4l606mpY%2F4lCkO%2BuiUyQQnHcCL%2BWDcagpXWh4zeOeMmPep8ZBE%2FBFOokkBdWcHDjk5SlLj8Z4R3%2FtvHYM1XFoxDM%2FquZPHqb2p--C%2BU2yN%2Bb0Qbbm1zP--tbhJ43J%2BPoJ7qXX8MckWpg%3D%3D; HttpOnly; SameSite=Lax; Secure; Path=/; Expires=Wed, 22 Oct 2025 19:53:57 GMT
cf-ray: 98b82ff16e9d860e-BLR
エンジニアのためのSRE論文への招待 / Introduction to SRE Papers for Engineers - Speaker Deck
エンジニアのためのSRE論文への招待 / Introduction to SRE Papers for Engineers
SRE NEXT 2023 IN TOKYOでの20分の講演スライド。https://sre-next.dev/2023/schedule/#jp029
当日の講演で使用したスライドでは、時間の関係上スキップしたスライドをすべて含めて公開しています。
Yuuki Tsubouchi (yuuk1)
September 30, 2023
More Decks by Yuuki Tsubouchi (yuuk1)
Featured
Transcript
-
2 ϓϩϑΟʔϧ Yuuki TSUBOUCHI (yuuk1) ͘͞ΒΠϯλʔωοτݚڀॴɹ্ڃݚڀһ TopotalɹςΫϊϩδΞυόΠβʔ ژେֶେֶӃ ใֶݚڀՊ ത࢜ޙظ՝ఔ
ೝఆୀֶ https://yuuk.io/ SRE NEXTొஃྺ @yuuk1t 2020 2022 SREͷ૯ औΓΜͰ͍ΔSREݚڀͷ جௐߨԋ ެืηογϣϯ AIOpsݚڀͷ ݪҼஅͷࣗಈԽ -
6 ̕લͷϒϩάهࣄʮΠϯϑϥΤ ϯδχΞ͚γεςϜܥจʯ ɾOS,DB,NWܥจͷհ ΤϯδχΞͱจ ※1 y_uuki, ΠϯϑϥΤϯδχΞ͚γεςϜܥจ, 2014 https://blog.yuuk.io/entry/system-papers.
※2 engineering-reading-papers.md, https://gist.github.com/yuuki/20f22bdd85a00630006b8dab6386881e ※1 ɾSREͷීٴҎޙɺจʹSREͷݴٴSREbookͷҾ༻͕ΈΒΕΔ ɾจΛಡΉ͜ͱʹڵຯΛ࣋ͭɾಡΜͰ͍ΔΤϯδχΞগͳ͘ͳ͍ ɾಛʹٕज़จٕज़ऀ͕ಡΉ͜ͱΛఆ͍ͯ͠Δͣ ※2 -
ΤϯδχΞ͍ͬͯΔΫϥυαʔϏεOSSͷཧղͷͨΊʹಡ ΜͰ͍Δ 7 ɾࣾձ࣮ޙʹจൃද͞Εٕͨज़ͷఏҊจ ɾจൃදޙʹࣾձ࣮͞Εɺීٴٕͨ͠ज़ͷఏҊจ ٕज़จͷେྨ طීٴٕज़จ ະීٴٕज़จ ※1 rrreeeyyy,
”Web αʔϏεͷ৴པੑͱӡ༻ͷࣗಈԽʹ͍ͭͯ”, ใॲཧֶձୈ40ճΠϯλʔωοτͱӡ༻ٕज़ݚڀձ ɹɹ ɹ টߨԋ, 2018 https://speakerdeck.com/rrreeeyyy/iot40-rrreeeyyy. ※1 ɾ·ͩීٴ͍ͯ͠ͳ͍ٕज़ΛఏҊ͢Δจ ※ ൃදऀʹΑΔಠࣗͷྨͱ༻ޠ ྫʣ Aurora, DynamoDB, Spanner, TiDB, CockroachDB, Firecracker, gVisor, Dapper, Gorillaͷࠩූ߸Խ, Monarch… -
8 SREจͷ୳͠ํɾಡΈํΒΕ͍ͯͳ͍… ࠓͷߨԋͷಈػ ΤϯδχΞ͕ΞΠσΟΞΛ୳ͨ͢ΊͷɺSREͷະීٴ ٕज़จʹணͨ͠ɺจͷ୳͠ํͱಡΈํΛհ ৽͍ٕ͠ज़Λ࣮ɾద༻ ͍ͨ͠ΤϯδχΞ ະීٴٕज़จͷ ΞΠσΟΞ ※1
ΦʔϓϯιʔεͱΞϧΰϦζϜͷಛڐ, https://ny23.hatenadiary.org/entry/20100701/p1 ⾠ಛڐొࡁΈͷ߹ʹҙ ※1 🤝 -
10 ਓྨʹੵ͞ΕͨʮʯͷྖҬΛԡ͛ͨ͠ূͱͯ͠ͷจॻ ֶज़จͱ ※ ൃදऀʹΑΔಠࣗͷఆٛ খɾதֶߍ ߴߍ େֶ म࢜՝ఔ ത࢜՝ఔ
The illustrated guide to Ph.DΑΓൈਮɾҰ෦վม طଘͷʹରͯ͠৽ͨʹ ੵΈ্͛ͨΛূ ࢀߟɿখా ३ਓ, “ത࢜՝ఔͷޡղͱਅ࣮ ʔਐֶʹ͚ͯɺ྆Λઆಘͨ͠ࢿྉΛͱʹʔ“, 2018. https://www.slideshare.net/atsutoonoda/ss-124873093. ਓྨͷطྖҬ ਓྨͷະྖҬ -
11 ֶज़จͷओͳ۠ ֶҐจ ֶҐΛಘΔͨΊʹେֶػؔͳͲʹఏग़͞ΕΔจ ܝࡌจ ࢀߟ จͷछྨͷҧ͍, 2008 https://next49.hatenadiary.jp/entry/20080612/p2. ɹɹ
จͷछྨͱҐஔ͚ͮ, https://wrc.sfc.keio.ac.jp/?p=129 ഔମʹܝࡌ͞ΕΔจɻֶձʢACMɺIEEEͳͲʣ͕ӡӦ จࢽจ ձٞจ ࡶࢽʹܝࡌ͞ΕΔɻδϟʔφϧจͱݺΕΔɻ ΧϯϑΝϨϯεͰޱ಄ൃද͞ΕΔɻ ɾใՊֶܥͰɺࠃࡍձ͕ٞଞΑΓॏࢹ͞ΕΔ ɾݚڀͷ࠷ऴ൛Λެ։͢Δͱͯ͠ͷҐஔ͚ͮ ɾஶऀͷݚڀ׆ಈΛ૯ׅ͢Δɻ ɾෳͷܝࡌจͷ༰Λܨ͗߹ΘͤΔ͜ͱ͋Δɻ ※ओʹใՊֶܥͷ۠ ࣮ࡍʹಡΉຊ ͕ଟ͍ -
12 ֶज़จͷ༰ʹΑΔ۠ʢใՊֶܥʣ ݚڀจʢresearchʣ ૯આɾௐࠪจʢreview, surveyʣ ɾطଘͷจΛௐ্͛ɺྨͨ͠ΓൺֱධՁ͢Δจ ɾA Survey of/on …
Ͱ࢝·Δදͷจ͕ଟ͍ ɾ৽نੑͷ͋ΔΞϧΰϦζϜγεςϜΛఏҊ͢Δจ ɾ͞ʹΑͬͯɺFull/Short/Poster/Position paperͳͲʹ۠͞ΕΔ ࢈ۀจʢindustrialʣ ɾ࣮ੈքͷγεςϜͷ࣮ࡍతͳɺ؍ɺଌఆʹॏΛஔ͘ ɾจ೦ͳ͕Βଟ͘ͳ͍ ※1 MIDDLEWARE 2020 CALL FOR INDUSTRY PAPERS, 2022 https://middleware-conf.github.io/2022/call-for-industry-papers/. ※2 SIGMOD 2023 Call for Papers - Industrial Track https://2023.sigmod.org/calls_industrial_track_papers.shtml. ྫ ※1,2 ࣮ࡍʹಡΉຊ͕ଟ͍ -
σʔλϕʔε ετϨʔδ 13 SREจͱ ※ൃදऀಠࣗͷ༻ޠ ຊߨԋͰͷSREจͷൣғ ιϑτΣΞֶ γεςϜ ιϑτΣΞ OSɺࢄγεςϜɺ
ݴޠॲཧܥ ίϯϐϡʔλ ωοτϫʔΫ ιϑτΣΞͷ࣭ɾ։ൃ ɾอकੑ ৴པੑֶ ػցݐஙͳͲͷނ োʹର͢Δ৴པੑΛ ੳ͢Δ Ϋϥυ ίϯϐϡʔςΟϯά Site Reliability Engineering SLI/SLOɺObservabilityɺ Πϯγσϯτཧɺ… ֶࡍྖҬͰ͋ΔͨΊ ໌֬ͳઢҾ͖ࠔ -
17 ɾΠϯσΩγϯά͕ૣ͘ಈ࡞ߴ ɾޙड़͢Δ௨ػೳ͕ศར ɾɿSREconͷαΠτ͕ώοτ จݕࡧΤϯδϯ Google Scholar Connected Papers ɾจؒͷҾ༻ؔΛ͍͢͠
ɾແྉϓϥϯͰػೳ੍͕ݶ͞ΕΔ ݕࡧΩʔϫʔυ ɾ“Observability”ͳͲଞͷจ͕ώοτ͕ͪ͠ ɾଞͰΘͳ͍ϚδοΫϫʔυʢ“Microservices”ͳͲʣΛؚΊΔ ɾૈ͍ϑΟϧλʔͱͯ͠ACM, IEEE, USENIXͷจΛબͿ ※1 Connected Papers, https://www.connectedpapers.com/. ※1 -
18 SREจΛ୳͢ํ๏ SREbookͷҾ༻จ ࠃࡍձٞͷ८ճ SREͷؔ࿈ॏࢹ ɾ৽͞ɾ࣭ॏࢹ ઌͷϦετͷFieldʹ - Software Engineering
- Reliability - Cloud Computing ͷ͍ͣΕ͔ΛؚΉձٞͷϓϩάϥϜΛ ΈΔ SREͷؔ࿈ ੑΑΓڧΊ ̍ձٞ։࠵͋ͨΓ1,2ຊ ൃݟͰ͖Εे -
19 ϒοΫϚʔΫʹPaperpile͕͓͢͢Ί ɾจϑΝΠϧΛGoogle DriveʹϑΝΠϧ໊Λਖ਼نԽͯ͠อଘՄೳ ɾϒϥβ֦ுͰGoogle Scholarͱ࿈ܞՄೳ จͷϒοΫϚʔΫͱ௨ Google Scholar Alert
ɾҭ͍ͯͯ͘ͱडಈతͳ୳ࡧ͕Ͱ͖ΔΑ͏ʹͳΔ ɾϝʔϧ௨Մೳ ɾϑΥϩʔதͷจ͕ଞจʹҾ༻͞Εͨͱ͖ ɾϑΥϩʔதͷஶऀ͕৽نจΛެ։ͨ͠ͱ͖ ※1 Paperpile, https://paperpile.com/ ※1 -
20 SREจͷྫ Hauer, et al., “Meaningful Availability”, NSDI 2020. [Hauer+,NSDI2020]ͷදࢴͷసࡌ
ɾGoogleͷG SuiteͰ༻͍ΒΕ͍ͯΔՄ༻ੑࢦඪ ɾαʔϏεԽOSSԽ͞Ε͍ͯͳ͍ະීٴٕज़ ɾSREcon21ͰPinterestͰͷద༻ࣄྫ͋Γ ※1 Anika Mukherji, User Uptime in Practice, SREcon, 2021. ※1 -
21 ͦͷଞͷSREจͷྫʢ̍ʣ eBPF༝དྷͷϝτϦΫεΛίϯςφͷஔઓུੑೳղੳʹ༻ Neves, et al., Black-box Inter-application Traf fi
c Monitoring for Adaptive Container Placement, SAC, 2020. Amaral, et al., MicroLens: A Performance Analysis Framework for Microservices Using Hidden Metrics With BPF, CLOUD, 2022. ࢄτϨʔεͷαϯϓϦϯάΛղܾ͢ΔMLϞσϧ ϓϩμΫγϣϯͷΠϯγσϯτͷੳ Wu, et al., An Empirical Study on Change-induced Incidents of Online Service Systems, ICSE 2023. Ghoso, et al., How to Fight Production Incidents? An Empirical Study on a Large-scale Cloud Service, SoCC 2022. Huang, et al., Sieve: Attention-based Sampling of End-to-End Trace Data in Distributed Microservice Systems, ICWS, 2021. Las-Casas, et al., Sifter: Scalable Sampling for Distributed Traces, without Feature Engineering, SoCC, 2019. -
22 ͦͷଞͷSREจͷྫʢ̎ʣ LLMΛ༻͍ͨোͷݪҼஅϩάੳ Ahmed, et al., Recommending Root-Cause and Mitigation
Steps for Cloud Incidents using Large Language Models, ICSE 2023. Gupta, et al., Learning Representations on Logs for AIOps, CLOUD 2023. ϝτϦΫε͔Β߹͞ΕͨSLOΛ༻͍ͨಈతεέʔϦϯάϑϨʔϜϫʔΫ Nastic, et al., SLOC: Service Level Objectives for Next Generation Cloud Computing, IEEE Internet Computing 24(3). Pusztai, et al., SLO Script: A Novel Language for Implementing Complex Cloud-Native Elasticity- Driven SLOs, ICWS, 2021. Pusztai, et al., A Novel Middleware for Ef fi ciently Implementing Complex Cloud-Native SLOs, CLOUD, 2021. Nastic, et al., Polaris Scheduler: Edge Sensitive and SLO Aware Workload Scheduling in Cloud- Edge-IoT Clusters, CLOUD, 2021. OSS: https://github.com/polaris-slo-cloud/polaris-slo-framework. -
24 จΛ୳͢ͱ͖ಡ ࣮ɾద༻Λݕ౼͢Δͱ͖ਫ਼ಡ ಡΈํͷجຊํ ※ จʹ׳Ε͍ͯͳ͍͏ͪ Կຊ͔ਫ਼ಡΛͨ͠΄͏͕Α͍ ʮSREจͷྫʯͷจΛϐο ΫΞοϓͯ͠ಡΉͳͲ λΠτϧɾཁɺਤද
͚ͩΛಡΉ ϊʔτΛͱΓͳ͕ΒಡΉ ʢޙଓϖʔδࢀরʣ ಡΈ͍ͨจ͔Ͳ͏͔ Λૣ͘δϟοδ ࢀߟɿ ଠ, ”จͷಡΈํɾॻ͖ํɾݚڀࣨͷա͝͠ํ - NAIST”, 2020 https://bit.ly/naist-how-to-research. -
25 Introduction ใܥจͷయܕߏͱಡΉॱ൪ Related Work Method Experiment Conclusion Abstract ᶃ
ᶅ ᶄ ᶆ ᶇ ͜ͷจͰԿΛ͔ͨ͠ʁ Ͳ͜·ͰͰ͖ͨͷ͔ʁ ͳͥ͜ͷจ͕ॏཁͳͷ͔ʁ ᶈ ͕ͪΌΜͱղ͚͍ͯΔ͔ʁ ଞͱԿ͕ҧ͏͔ʁ ͦͷҧ͍͔Β͘Δͳʹ͔ʁ จͷҐஔ͚ͮ จͷཁ ؔ࿈ݚڀͱઃఆ ఏҊͷৄࡉ ࣮ݧɾධՁɾߟ ݁ ΛͲ͏ͬͯղ͍͔ͨʁ ࢀߟɿམ߹ཅҰ,ઌٕज़ͱϝσΟΞදݱ#1 #FTMA15, 2015 https://www.slideshare.net/Ochyai/1-ftma15. -
26 IntroductionͷಡΈํ ࣾձͷഎܠ ҙࣝ Ұൠతͳ هड़ ᶃ จݻ༗ͷ ᶄ
จͷఏҊʹ ࠷͍ۙഎܠ طଘख๏ ͱͦͷ՝ ᶅ ᶆ ఏҊͷ֓ཁ ධՁͷ֓ཁ Introductionจͷશମ૾͕ॻ͍ͯ͋ΔͷͰॏཁ ͘ΒΕͨ ᶄʹΞϓϩʔν ͖ͯͨ͠ઌਓୡ ᶅͷ՝Λղܾ ͢Δղܾࡦ -
27 IntroductionͷಡΈํʢྫʣ എܠ/ҙࣝ Hauer, et al., “Meaningful Availability”, NSDI 2020.
ΑΓసࡌ ᶃ ᶄ ᶅ ᶆ ৴པੑΛఆྔԽ windowed user-uptime طଘख๏ͷ՝ ޭΞΫςΟ ϒϢʔβʔʹภΔ ͳͲ ఏҊख๏ ධՁํ๏ ޭͱൺֱ G Suiteͷσʔλ Λ༻ SLI should be - meaningful - proportional - actionable ͜ΕΒͷཁ݅Λ ຬͨ͢ࢦඪͳ ͍ -
28 Q1. औΓ͔͋ͭ͏ͳʹ͔ʁͦͷʹڵຯ͋Δ͔ʁ ɾHowʹ͕ߦ͖͕ͪ ࣮ɾద༻͢Δ্Ͱͷண Q2. ࣮ݧڥ࣮ݧ݅ɺࣗͷఆͷൣғ͔ʁ ɾ࣮ݧڥͷن͕খ͗͢͞Δ·ͨେ͖͗͢Δ͜ͱ Q3. ࣮ݧ݁ՌظͰ͖Δ݁Ռ͔ʁ
Q4. ఏҊ༰ݱ࣮తʹ࣮ɾద༻Մೳ͔ʁ ɾจޙͷDiscussionʹݴٴ͕͋Δ߹͋Δ ɾίʔυσʔλ͕ඇެ։ͳ͜ͱଟ͍ -
30 ಡΜͩจΛͰهԱ͓ͯ͘͜͠ͱ͍͠ ɾಛʹະීٴٕज़จɺͦͷจҎ֎ͰΈΒΕͳ͍ΛؚΉ ɾจͷϢχʔΫ͞Λཧղ͢ΔͨΊʹจݻ༗ͷจ຺͕͋Δ ಡॻه จಡΈϊʔτΛ࡞Δ ɾจͷਤΛ࡞Δɻจؒͷܨ͕Γछผ͝ͱͷྨΛߦ͏ɻ ɾωοτϫʔΫܕͷϊʔτΞϓϦObsidian͕͓͢͢Ί ɾϒοΫϚʔΫܥπʔϧطಡཧʹ͍͍ͯΔ͕ɺࣗͷࣝ ཧʹ͔ͳ͍ʢൃදऀओ؍ʣ
ࢀߟɿjoisino, ”จಡΈͷ՝ʹ͍ͭͯ”, 2023 https://joisino.hatenablog.com/entry/2023/04/10/170519. -
35 1. ଠ, ”จͷಡΈํɾॻ͖ํɾݚڀࣨͷա͝͠ํ - NAIST”, 2020 https://bit.ly/naist-how-to- research.
2. མ߹ཅҰ, ”ઌٕज़ͱϝσΟΞදݱ#1 #FTMA15”, 2015 https://www.slideshare.net/Ochyai/1-ftma15. 3. ຊଟ ྙ, ”γεςϜܥจͷಡΈํͱ୳͠ํ”, https://micchie.net/ fi les/RG-HowToPaper.pdf. 4. joisino, ”จಡΈͷ՝ʹ͍ͭͯ”, 2023 https://joisino.hatenablog.com/entry/2023/04/10/170519. 5. S. Keshav, “How to Read a Paper”, ACM SIGCOMM Computer Communication Review, 2007. ࢀߟจݙ