CINXE.COM

KYD FAQs

<!DOCTYPE html> <html> <head> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <!-- Do we need additional share metadata included here? --> <!-- Global site tag (gtag.js) - Google Analytics --> <script async src="https://www.googletagmanager.com/gtag/js?id=G-NX08RFMGJW" nonce="a7CPYalan1_Pz4Aq7jbGMQ"></script> <script nonce="a7CPYalan1_Pz4Aq7jbGMQ"> window.dataLayer = window.dataLayer || []; function gtag(){dataLayer.push(arguments);} gtag('js', new Date()); gtag('config', 'G-NX08RFMGJW'); </script> <title>KYD FAQs</title> <link rel="stylesheet" href="https://fonts.googleapis.com/icon?family=Material+Icons" nonce="a7CPYalan1_Pz4Aq7jbGMQ"> <link rel="stylesheet" href="/assets/css/material.min.css" nonce="a7CPYalan1_Pz4Aq7jbGMQ"> <link rel="stylesheet" href="/assets/css/material.min.css" nonce="a7CPYalan1_Pz4Aq7jbGMQ"> <link rel="stylesheet" href="/assets/css/prism-material-light.css" nonce="a7CPYalan1_Pz4Aq7jbGMQ"> <link href="https://fonts.googleapis.com/css?family=Open+Sans:400,400italic,500,500italic,700,700italic|Google Sans:400,400italic,500,500italic,700,700italic|Roboto" rel="stylesheet" nonce="a7CPYalan1_Pz4Aq7jbGMQ"> <link href="/assets/css/new.css" rel="stylesheet" nonce="a7CPYalan1_Pz4Aq7jbGMQ"> <link rel="icon" href="/assets/images/favicon.png" type="image/png"/> <meta name="keywords" content="machine learning,dataset,visualization"> <meta name="description" content="A tool to understand datasets with the goal of improving data quality, and helping mitigate fairness and bias issues."> <meta property="og:type" content="article"/> <meta property="og:title" content="Know Your Data"/> <meta property="og:url" content="https://knowyourdata.withgoogle.com"/> <meta property="og:image" content="https://knowyourdata.withgoogle.com/assets/images/meta-preview.png"/> <meta name="twitter:card" value="summary_large_image"> <meta name="twitter:title" content="Know Your Data"> <meta name="twitter:url" content="https://knowyourdata.withgoogle.com"> <meta name="twitter:image" content="https://knowyourdata.withgoogle.com/assets/images/meta-preview.png"> <meta name="author" content="The PAIR team, Google"> <meta charset="utf-8"> <style> .sub-title, .sub-copy { color: #000; } </style> </head> <body> <div class="mdl-layout mdl-layout--no-desktop-drawer-button mdl-js-layout mdl-layout--fixed-header"> <header class="mdl-layout__header"> <div class="mdl-layout__header-row"> <!-- Title --> <span class="mdl-layout__title"> <a href="/"> Know Your Data <sup>BETA</sup> </a> </span> <!-- Add spacer, to align navigation to the right --> <div class="mdl-layout-spacer"></div> <!-- Navigation. We hide it in small screens. --> <nav class="mdl-navigation mdl-layout--large-screen-only"> <a class="mdl-navigation__link" href="/docs/">DOCUMENTATION</a> <a class="mdl-navigation__link" href="/faqs/">FAQ</a> <a class="mdl-navigation__link" href="https://groups.google.com/g/knowyourdata-announce" target="-_blank">STAY UP TO DATE<img alt="stay up to date" class="header-arrow" src="/assets/images/arrow-link-out.svg"/></a> <a class="mdl-navigation__link" href="https://github.com/pair-code/knowyourdata" target="-_blank">GITHUB<img class="header-arrow" src="/assets/images/arrow-link-out.svg"/></a> </nav> </div> </header> <div class="mdl-layout__drawer"> <span class="mdl-layout__title"><a href="/">Know Your Data</a></span> <nav class="mdl-navigation"> <a class="mdl-navigation__link" href="/docs/">DOCUMENTATION</a> <a class="mdl-navigation__link" href="/faqs/">FAQ</a> <a class="mdl-navigation__link" href="https://groups.google.com/g/knowyourdata-announce" target="_blank">STAY UP TO DATE<img alt="stay up to date" class="header-arrow" src="/assets/images/arrow-link-out.svg"/></a> <a class="mdl-navigation__link" href="https://github.com/pair-code/knowyourdata" target="-_blank">GITHUB<img class="header-arrow" src="/assets/images/arrow-link-out.svg"/></a> </nav> </div> <main class="mdl-layout__content hero-banner"> <div class="sub-hero-container" style="background-image:url( /assets/images/LIT_FAQs_Banner.png ); height: 100px; "> <div class="hero-container mdl-grid"> <div class="mdl-cell--8-col mdl-cell--6-col-tablet mdl-cell--4-col-phone"> <div class="sub-title">Frequently Asked Questions</div> <div class="sub-copy"></div> </div> </div> </div> <div class="sub-page-container mdl-grid"> <div class="mdl-cell--12-col mdl-cell--8-col-tablet mdl-cell--4-col-phone"> <div class="mdl-cell--8-col mdl-cell--8-col-tablet mdl-cell--4-col-phone"> <div class="faq-element"> <h3 class="faq-title">How does KYD work?</h3> <div class="faq-copy"> <p>KYD allows users to explore the dataset by information that wasn鈥檛 originally in the dataset. The tool annotates the existing data using machine learning models like <a href="https://cloud.google.com/vision/docs/labels">Cloud Vision labels</a>, <a href="https://cloud.google.com/vision/docs/detecting-faces">Cloud Vision face detection</a>, and general <a href="https://en.wikipedia.org/wiki/Image_quality">image quality metrics</a> (e.g. sharpness and brightness). We compute these annotations in an <a href="https://beam.apache.org/">Apache Beam</a> pipeline. We then load the database in a web server to enable interactivity. The statistics are computed on the fly, each time you click.</p> </div> </div> <div class="faq-element"> <h3 class="faq-title">Can I run Know Your Data on my own data?</h3> <div class="faq-copy"> <p>Not yet. For now, we鈥檙e only serving Know Your Data for image-based datasets supported by the TensorFlow Datasets API.</p> </div> </div> <div class="faq-element"> <h3 class="faq-title">Will KYD work for other types of data?</h3> <div class="faq-copy"> <p>Yes. We are actively working on other modalities, including text datasets.</p> </div> </div> <div class="faq-element"> <h3 class="faq-title">What about model analysis?</h3> <div class="faq-copy"> <p>We are actively thinking about model analysis in KYD.</p> </div> </div> <div class="faq-element"> <h3 class="faq-title">How do I report an issue?</h3> <div class="faq-copy"> <p>For any bugs and feature requests related to the application, please <a href="https://github.com/pair-code/knowyourdata/issues/new">file a bug</a>. For any other questions, issues or concerns, email us at <a href="mailto:knowyourdata-feedback@google.com">knowyourdata-feedback@google.com</a>.</p> </div> </div> <div class="faq-element"> <h3 class="faq-title">Will KYD add more signals?</h3> <div class="faq-copy"> <p>KYD will continue to develop over time and we are very interested in making datasets more understandable by adding more signals. If you have a specific signal in mind, please <a href="mailto:knowyourdata-feedback@google.com">let us know</a>.</p> </div> </div> <div class="faq-element"> <h3 class="faq-title">Why doesn鈥檛 KYD use automated signals for protected attributes (for example, perceived gender expression, age or skin tone)?</h3> <div class="faq-copy"> <p>Although we are aware that signals on protected attributes can help tackle fairness issues in datasets, labeling individuals in datasets could lead to undesirable outcomes. We seek to avoid creating or reinforcing unfair bias and are looking into trusted responsible approaches to adding more signals before proceeding forward.</p> </div> </div> <div class="faq-element"> <h3 class="faq-title">Why are not all of the datasets supported by the TFDS API enabled in KYD?</h3> <div class="faq-copy"> <p>KYD features datasets with the appropriate licenses for us to serve them. We accept <a href="https://www.apache.org/licenses/LICENSE-2.0">Apache 2.0</a>, <a href="https://en.wikipedia.org/wiki/MIT_License">MIT</a> and <a href="https://creativecommons.org/">Creative Commons</a>. If you don鈥檛 see a TFDS supported dataset enabled in KYD, it鈥檚 likely that it may not have one of these licenses. If you are a dataset author interested in visualizing your dataset in KYD, please <a href="mailto:knowyourdata-feedback@google.com">let us know</a>.</p> </div> </div> <div class="faq-element"> <h3 class="faq-title">How can I learn more about the performance of KYD鈥檚 machine learning models?</h3> <div class="faq-copy"> <p>A key design principle of KYD is to always show the underlying data points to earn user trust. In addition, we offer details on KYD鈥檚 machine learning (ML) models. To understand their overall performance and limitations you can look at the publicly available <a href="https://modelcards.withgoogle.com/model-reports">model cards</a>.</p> </div> </div> </div> </div> <div class="fixed-sub-navigation hide-me"> <a href="https://github.com/pair-code/knowyourdata/issues/new" target="_blank">Ask a question</a> </div> </div> <div class="footer-container mdl-grid"> <div class="mdl-cell mdl-cell--2-col mdl-cell--2-col-tablet mdl-cell--4-col-phone"><a href="https://pair.withgoogle.com/" target="_blank"><img alt="Google PAIR logo" src="/assets/images/pair-logo.svg"/></a></div> <div class="mdl-cell mdl-cell--2-col mdl-cell--2-col-tablet mdl-cell--4-col-phone"><a href="https://github.com/pair-code" target="_blank">Github</a></div> <div class="footer-icons mdl-cell mdl-cell--4-col mdl-cell--8-col-tablet mdl-cell--4-col-phone"> <a href="mailto:peopleai@google.com" target="_blank"><img alt="Mail People AI at Google" src="/assets/images/mail.png"/></a> <a href="https://medium.com/people-ai-research" target="_blank"><img alt="PAIR Medium blog" src="/assets/images/medium.png"/></a> <a href="https://www.youtube.com/channel/UCnnns-uu4yy9BXfYSPIX5AA" target="_blank"><img alt="PAIR YouTube channel" src="/assets/images/youtube.png"/></a> </div> </div> </main> </div> </body> <script defer src="/assets/js/material.min.js" nonce="a7CPYalan1_Pz4Aq7jbGMQ"></script> <script defer src="https://unpkg.com/material-components-web@11.0.0/dist/material-components-web.min.js" nonce="a7CPYalan1_Pz4Aq7jbGMQ"></script> </html>

Pages: 1 2 3 4 5 6 7 8 9 10