CINXE.COM
Jubayer Ibn Hamid
<!DOCTYPE HTML> <html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"> <title>Jubayer Ibn Hamid</title> <meta name="author" content="Jubayer Ibn Hamid"> <meta name="viewport" content="width=device-width, initial-scale=1"> <link rel="stylesheet" type="text/css" href="stylesheet.css"> <!-- <link rel="icon" href="data:image/svg+xml,<svg xmlns=%22http://www.w3.org/2000/svg%22 viewBox=%220 0 100 100%22><text y=%22.9em%22 font-size=%2290%22>🌐</text></svg>"> --> <link rel="icon" href="./images/cayleygraph.png"> <!-- <link rel="icon" href="data:image/svg+xml,<svg xmlns=%22http://www.w3.org/2000/svg%22 viewBox=%220 0 100 100%22><text y=%22.9em%22 font-size=%2290%22>🌐</text></svg>"> --> </head> <body> <table style="width:100%;max-width:800px;border:0px;border-spacing:0px;border-collapse:separate;margin-right:auto;margin-left:auto;"><tbody> <tr style="padding:0px"> <td style="padding:0px"> <table style="width:100%;border:0px;border-spacing:0px;border-collapse:separate;margin-right:auto;margin-left:auto;"><tbody> <tr style="padding:0px"> <td style="padding:2.5%;width:63%;vertical-align:middle"> <p style="text-align:center"> <heading style="display:block; font-size:3em; margin-bottom: 10px;">Jubayer Ibn Hamid</heading> </p> <p> I am an incoming Ph.D. student, currently doing research at Stanford where I am advised by <a href="https://ai.stanford.edu/~cbfinn/">Chelsea Finn</a> and <a href="https://dorsa.fyi">Dorsa Sadigh</a>. I studied Mathematical Physics (B.S) and Computer Science (M.S), both at Stanford University. <p style="text-align:center"> <p> I work in artificial intelligence with a focus on the intersection of reinforcement learning, generative models and representation learning. I am also interested in pure mathematics such as abstract algebra, category theory and algebraic geometry. <p style="text-align: center"> <a href="data/CV.pdf">CV</a>  /  <a href = "https://scholar.google.co.uk/citations?user=3G2EbP4AAAAJ&hl=en">Scholar</a>  /  <!-- <a href="https://github.com/Jubayer-Hamid">Github</a>  /  --> <a href="mailto:jubayer@stanford.edu">Email</a>  /  <a href="https://twitter.com/jubayer_hamid">Twitter</a>     </p> </td> <td style="padding:2.5%;width:20%;max-width:20%"> <a href="./images/cohojubayer.jpg"><img style="width:100%;max-width:100%" alt="profile photo" src="./images/cohojubayer.jpg" class="hoverZoomLink"></a> </td> </tr> </tbody></table> <table style="width:100%;border:0px;border-spacing:0px;border-collapse:separate;margin-right:auto;margin-left:auto;"> <tbody> <tr> <td style="padding:20px;width:100%;vertical-align:middle"> <heading style="display:block; font-size:2em; margin-bottom: 10px;">Research</heading> <p> I am currently interested in sample-efficient and deep exploration methods for online reinforcement learning fine-tuning, spanning both language models and embodied agents. My research also focuses on test-time decoding from complex, multimodal policies and training robotic policies with long context. Additionally, I am interested in understanding what kinds of data to scale for improved generalization in robot learning. </p> <p style="margin-bottom: 0;"> (*) denotes co-first authorship </p> <ul style="margin-top: 0;"> <li> <p> <papertitle><a href="https://arxiv.org/abs/2408.17355">Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling</a></papertitle>. Yuejiang Liu*, Jubayer Ibn Hamid*, Annie Xie, Yoonho Lee, Max Du, Chelsea Finn. International Conference on Learning Representations (ICLR), 2025. <a href="https://bid-robot.github.io/">(Website</a>)</a> </p> </li> <li> <p> <papertitle><a href="https://arxiv.org/abs/2404.10282">Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning</a></papertitle>. Kyle Hsu*, Jubayer Ibn Hamid*, Kaylee Burns, Chelsea Finn, Jiajun Wu. International Conference on Machine Learning (ICML), 2024.</a> </p> </li> <li> <p> <papertitle><a href="https://arxiv.org/abs/2312.12444">What Makes Pre-trained Visual Representations Successful For Robust Manipulation?</a></papertitle> Kaylee Burns, Zach Witzel, Jubayer Ibn Hamid, Tianhe Yu, Chelsea Finn, Karol Hausman. Conference on Robot Learning (CoRL), 2024.</a> </p> </li> </ul> </td> </tr> </tbody> </table> <!-- <table style="width:100%;border:0px;border-spacing:0px;border-collapse:separate;margin-right:auto;margin-left:auto;"><tbody> <tr> <td style="padding:20px;width:100%;vertical-align:middle"> <heading style="display:block; font-size:2em; margin-bottom: 10px;">Notes</heading> <p> I am sharing notes on various topics that have fascinated me. These are not meant to be in-depth. Rather, they are meant to cover some of the basic constructions that show up periodically and are also interesting in and of themselves. </p> <p> <a href="data/Algebraic_Geometry_Notes.pdf">Algebraic Geometry</a>. <i>Foundational constructions and results in algebraic geometry (Incomplete).</i> </p> <p> <a href="data/Algebraic topology notes.pdf">Algebraic Topology</a>. <i>Foundational constructions - fundamental group, homology and cohomology. (Incomplete, will typeset later). </i> </p> <p> <a href="data/Whitney_s_Theorems.pdf">Whitney's Embedding Theorem and Immersion Theorem</a>. <i>Weak versions of Whitney's Embedding and Immersion theorem, which are often sufficient. </i> </p> <p> <a href="data/Policy_Gradient_Methods.pdf">Policy Gradient Methods</a>. <i>Building blocks (including the policy gradient theorems for both episodic and continuing tasks) of policy gradient algorithms.</i> </p> </td> </tr> </tbody></table> --> <table style="width:100%; border:0px; border-spacing:0px; border-collapse:separate; margin-right:auto; margin-left:auto;"> <tbody> <tr> <td style="padding:20px; width:100%; vertical-align:middle;"> <heading style="display:block; font-size:2em; margin-bottom:10px;">Notes</heading> <p> Here are some introductory notes on various topics that have fascinated me. These are not meant to be in-depth. Rather, they are meant to cover some of the basic constructions that show up periodically and are also interesting in and of themselves. </p> <ul style="margin-top:0;"> <li> <p> <a href="data/Algebraic_Geometry_Notes.pdf">Category Theory and Algebraic Geometry</a>. <i>Notes on introductory category theory and foundational constructions in algebraic geometry (work in progress).</i> </p> </li> <li> <p> <a href="data/Algebraic topology notes.pdf">Algebraic Topology</a>. <i>Foundational constructions – fundamental group, homology and cohomology. (Incomplete, will typeset later).</i> </p> </li> <li> <p> <a href="data/Whitney_s_Theorems.pdf">Whitney's Embedding Theorem and Immersion Theorem</a>. <i>Weak versions of Whitney's Embedding and Immersion theorem, which are often sufficient.</i> </p> </li> <li> <p> <a href="data/RL_Notes__final_.pdf">Policy Gradient Methods</a>. <i> Building blocks of policy gradient methods in online reinforcement learning.</i> </p> </li> <li> <p> <a href="data/Trust_Region_Methods.pdf">Trust Region Optimization Methods</a>. <i> Introduction to trust region methods for optimization.</i> </p> </li> </ul> </td> </tr> </tbody> </table> <table style="width:100%;border:0;border-spacing:0;border-collapse:separate;margin:auto;"> <tbody> <tr> <td style="padding:20px;width:100%;vertical-align:middle;"> <h2 style="font-size:2em; margin-bottom:10px;">Talks</h2> <p> Bidirectional Decoding.<i> OpenAI. 25th February, 2025.</i> </p> </td> </tr> </tbody> </table> <table style="width:100%;border:0;border-spacing:0;border-collapse:separate;margin:auto;"> <tbody> <tr> <td style="padding:20px;width:100%;vertical-align:middle;"> <h2 style="font-size:2em; margin-bottom:10px;">Teaching</h2> <p> <a href="https://cs224r.stanford.edu" target="_blank" rel="noopener noreferrer"> CS 224R - Deep Reinforcement Learning </a> <i>: Head CA. Spring, 2025.</i> </p> <p> <a href="https://cs229.stanford.edu" target="_blank" rel="noopener noreferrer"> CS 229 - Machine Learning </a> <i>: CA. Winter, 2025.</i> </p> </td> </tr> </tbody> </table> </tbody></table> <table style="width:100%;border:0px;border-spacing:0px;border-collapse:separate;margin-right:auto;margin-left:auto;"><tbody> <tr> <td style="padding:0px"> <br> <p style="text-align:right;"> <a href="https://jonbarron.info/">Template</a> </p> </td> </tr> </tbody></table> </td> </tr> </table> </body> </html>