A Practical Guide To Dashboarding

A presentation at NDC Porto in February 2019 in Porto, Portugal by Jessica White

Slide 1

Slide 1

A Practical Guide to Dashboarding Jessica White Software Engineer

Slide 2

Slide 2

@JessPWhite #NDCPorto What Is A Dashboard & Why Should I Care?

Slide 3

Slide 3

@JessPWhite #NDCPorto

Slide 4

Slide 4

@JessPWhite #NDCPorto

Slide 5

Slide 5

@JessPWhite #NDCPorto

Slide 6

Slide 6

@JessPWhite #NDCPorto

Slide 7

Slide 7

@JessPWhite #NDCPorto Metrics VS Diagnostics ● High level view ● For assessing a system and spotting issues ● Kept in present view ● Log analytics ● For tracing and debugging ● Used when fixing issues

Slide 8

Slide 8

@JessPWhite #NDCPorto Suggested Baselines

Slide 9

Slide 9

@JessPWhite #NDCPorto Services

Slide 10

Slide 10

@JessPWhite #NDCPorto U.S.E

Slide 11

Slide 11

@JessPWhite #NDCPorto Utilisation

Slide 12

Slide 12

@JessPWhite #NDCPorto Saturation

Slide 13

Slide 13

@JessPWhite #NDCPorto Error

Slide 14

Slide 14

@JessPWhite #NDCPorto APIs

Slide 15

Slide 15

@JessPWhite #NDCPorto Four Golden Signals - Google Traffic Error Latency Saturation

Slide 16

Slide 16

@JessPWhite #NDCPorto Four Golden Signals - Google Traffic

Slide 17

Slide 17

@JessPWhite #NDCPorto Four Golden Signals - Google Error

Slide 18

Slide 18

@JessPWhite #NDCPorto Four Golden Signals - Google Latency

Slide 19

Slide 19

@JessPWhite #NDCPorto Four Golden Signals - Google Saturation

Slide 20

Slide 20

@JessPWhite #NDCPorto Four Golden Signals - Google Saturation

Slide 21

Slide 21

@JessPWhite #NDCPorto Four Golden Signals - Google Saturation

Slide 22

Slide 22

@JessPWhite #NDCPorto Four Golden Signals - Google Saturation

Slide 23

Slide 23

@JessPWhite #NDCPorto R.E.D Rate Error Duration

Slide 24

Slide 24

@JessPWhite #NDCPorto R.E.D Rate

Slide 25

Slide 25

@JessPWhite #NDCPorto R.E.D Error

Slide 26

Slide 26

@JessPWhite #NDCPorto R.E.D Duration

Slide 27

Slide 27

@JessPWhite #NDCPorto Rate Error Duration Traffic Error Latency Saturation

Slide 28

Slide 28

@JessPWhite #NDCPorto Response API Includes calls to the server

Slide 29

Slide 29

@JessPWhite #NDCPorto

Slide 30

Slide 30

@JessPWhite #NDCPorto Response API Includes calls to the server

Slide 31

Slide 31

@JessPWhite #NDCPorto Response API REQUEST Includes calls to the server RESPONSE SERVER

Slide 32

Slide 32

@JessPWhite #NDCPorto WHISPER CARBON SERVER

Slide 33

Slide 33

@JessPWhite #NDCPorto WHISPER CARBON SERVER

Slide 34

Slide 34

Demo

Slide 35

Slide 35

@JessPWhite #NDCPorto Task definition includes: ● Docker image ● Port mappings ● Environment variables ● Postgres endpoint ● Graphite address

Slide 36

Slide 36

@JessPWhite #NDCPorto Task definition includes: ● Docker image ● Port mappings ● Environment variables ● Postgres endpoint ● Graphite address Postgres Whisper Config

Slide 37

Slide 37

@JessPWhite #NDCPorto Do these metrics help you in real life?

Slide 38

Slide 38

@JessPWhite #NDCPorto

Slide 39

Slide 39

@JessPWhite #NDCPorto

Slide 40

Slide 40

@JessPWhite #NDCPorto Where Can I Learn More?

Slide 41

Slide 41

@JessPWhite #NDCPorto These slides and resources will be made available on my be.notist page after this session: https://noti.st/jesspwhite The link to which will be shared on Twitter @JessPWhite

Slide 42

Slide 42

@JessPWhite #NDCPorto Tom Wilkie -The Red Method: How To Instrument Your Services ● Talk from Influx Days ● Slides From GrafanaCon EU Brendan Greg -The USE Method ● Article on USE Method Google SRE Book ● Book available online ● Other google SRE resources Extra ● I also occasionally blog

Slide 43

Slide 43

@JessPWhite #NDCPorto

Slide 44

Slide 44

@JessPWhite #NDCPorto

Slide 45

Slide 45

@JessPWhite #NDCPorto

Slide 46

Slide 46

@JessPWhite #NDCPorto Thank you for listening