Jubayer Ibn Hamid

<!DOCTYPE HTML> <html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"> <title>Jubayer Ibn Hamid</title> <meta name="author" content="Jubayer Ibn Hamid"> <meta name="viewport" content="width=device-width, initial-scale=1"> <link rel="stylesheet" type="text/css" href="stylesheet.css">  <link rel="icon" href="./images/cayleygraph.png">  </head> <body> <table style="width:100%;max-width:800px;border:0px;border-spacing:0px;border-collapse:separate;margin-right:auto;margin-left:auto;"><tbody> <tr style="padding:0px"> <td style="padding:0px"> <table style="width:100%;border:0px;border-spacing:0px;border-collapse:separate;margin-right:auto;margin-left:auto;"><tbody> <tr style="padding:0px"> <td style="padding:2.5%;width:63%;vertical-align:middle"> <p style="text-align:center"> <heading style="display:block; font-size:3em; margin-bottom: 10px;">Jubayer Ibn Hamid</heading> </p> <p> I am an incoming Ph.D. student, currently doing research at Stanford where I am advised by <a href="https://ai.stanford.edu/~cbfinn/">Chelsea Finn</a> and <a href="https://dorsa.fyi">Dorsa Sadigh</a>. I studied Mathematical Physics (B.S) and Computer Science (M.S), both at Stanford University. <p style="text-align:center"> <p> I work in artificial intelligence with a focus on the intersection of reinforcement learning, generative models and representation learning. I am also interested in pure mathematics such as abstract algebra, category theory and algebraic geometry. <p style="text-align: center"> <a href="data/CV.pdf">CV</a> &nbsp/&nbsp <a href = "https://scholar.google.co.uk/citations?user=3G2EbP4AAAAJ&hl=en">Scholar</a> &nbsp/&nbsp  <a href="mailto:jubayer@stanford.edu">Email</a> &nbsp/&nbsp <a href="https://twitter.com/jubayer_hamid">Twitter</a> &nbsp &nbsp </p> </td> <td style="padding:2.5%;width:20%;max-width:20%"> <a href="./images/cohojubayer.jpg"><img style="width:100%;max-width:100%" alt="profile photo" src="./images/cohojubayer.jpg" class="hoverZoomLink"></a> </td> </tr> </tbody></table> <table style="width:100%;border:0px;border-spacing:0px;border-collapse:separate;margin-right:auto;margin-left:auto;"> <tbody> <tr> <td style="padding:20px;width:100%;vertical-align:middle"> <heading style="display:block; font-size:2em; margin-bottom: 10px;">Research</heading> <p> I am currently interested in sample-efficient and deep exploration methods for online reinforcement learning fine-tuning, spanning both language models and embodied agents. My research also focuses on test-time decoding from complex, multimodal policies and training robotic policies with long context. Additionally, I am interested in understanding what kinds of data to scale for improved generalization in robot learning. </p> <p style="margin-bottom: 0;"> (*) denotes co-first authorship </p> <ul style="margin-top: 0;"> <li> <p> <papertitle><a href="https://arxiv.org/abs/2408.17355">Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling</a></papertitle>. Yuejiang Liu*, Jubayer Ibn Hamid*, Annie Xie, Yoonho Lee, Max Du, Chelsea Finn. International Conference on Learning Representations (ICLR), 2025. <a href="https://bid-robot.github.io/">(Website</a>)</a> </p> </li> <li> <p> <papertitle><a href="https://arxiv.org/abs/2404.10282">Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning</a></papertitle>. Kyle Hsu*, Jubayer Ibn Hamid*, Kaylee Burns, Chelsea Finn, Jiajun Wu. International Conference on Machine Learning (ICML), 2024.</a> </p> </li> <li> <p> <papertitle><a href="https://arxiv.org/abs/2312.12444">What Makes Pre-trained Visual Representations Successful For Robust Manipulation?</a></papertitle> Kaylee Burns, Zach Witzel, Jubayer Ibn Hamid, Tianhe Yu, Chelsea Finn, Karol Hausman. Conference on Robot Learning (CoRL), 2024.</a> </p> </li> </ul> </td> </tr> </tbody> </table>  <table style="width:100%; border:0px; border-spacing:0px; border-collapse:separate; margin-right:auto; margin-left:auto;"> <tbody> <tr> <td style="padding:20px; width:100%; vertical-align:middle;"> <heading style="display:block; font-size:2em; margin-bottom:10px;">Notes</heading> <p> Here are some introductory notes on various topics that have fascinated me. These are not meant to be in-depth. Rather, they are meant to cover some of the basic constructions that show up periodically and are also interesting in and of themselves. </p> <ul style="margin-top:0;"> <li> <p> <a href="data/Algebraic_Geometry_Notes.pdf">Category Theory and Algebraic Geometry</a>. <i>Notes on introductory category theory and foundational constructions in algebraic geometry (work in progress).</i> </p> </li> <li> <p> <a href="data/Algebraic topology notes.pdf">Algebraic Topology</a>. <i>Foundational constructions – fundamental group, homology and cohomology. (Incomplete, will typeset later).</i> </p> </li> <li> <p> <a href="data/Whitney_s_Theorems.pdf">Whitney's Embedding Theorem and Immersion Theorem</a>. <i>Weak versions of Whitney's Embedding and Immersion theorem, which are often sufficient.</i> </p> </li> <li> <p> <a href="data/RL_Notes__final_.pdf">Policy Gradient Methods</a>. <i> Building blocks of policy gradient methods in online reinforcement learning.</i> </p> </li> <li> <p> <a href="data/Trust_Region_Methods.pdf">Trust Region Optimization Methods</a>. <i> Introduction to trust region methods for optimization.</i> </p> </li> </ul> </td> </tr> </tbody> </table> <table style="width:100%;border:0;border-spacing:0;border-collapse:separate;margin:auto;"> <tbody> <tr> <td style="padding:20px;width:100%;vertical-align:middle;"> <h2 style="font-size:2em; margin-bottom:10px;">Talks</h2> <p> Bidirectional Decoding.<i> OpenAI. 25th February, 2025.</i> </p> </td> </tr> </tbody> </table> <table style="width:100%;border:0;border-spacing:0;border-collapse:separate;margin:auto;"> <tbody> <tr> <td style="padding:20px;width:100%;vertical-align:middle;"> <h2 style="font-size:2em; margin-bottom:10px;">Teaching</h2> <p> <a href="https://cs224r.stanford.edu" target="_blank" rel="noopener noreferrer"> CS 224R - Deep Reinforcement Learning </a> <i>: Head CA. Spring, 2025.</i> </p> <p> <a href="https://cs229.stanford.edu" target="_blank" rel="noopener noreferrer"> CS 229 - Machine Learning </a> <i>: CA. Winter, 2025.</i> </p> </td> </tr> </tbody> </table> </tbody></table> <table style="width:100%;border:0px;border-spacing:0px;border-collapse:separate;margin-right:auto;margin-left:auto;"><tbody> <tr> <td style="padding:0px"> <br> <p style="text-align:right;"> <a href="https://jonbarron.info/">Template</a> </p> </td> </tr> </tbody></table> </td> </tr> </table> </body> </html>

CINXE.COM

Jubayer Ibn Hamid