type
status
date
slug
summary
tags
category
password
icon
Author
Abstract
This month has been emotionally intense, marked by a series of intriguing and unfortunate events. Many instances sparked curiosity and inspiration, while others sadly brought about sorrow and anger. It's truly been a month full of diverse experiences.
Work & Learning
This month, I used the CrewAI agent framework to assemble a team to write a script for me. My experience showed that CrewAI is user-friendly. All I had to do was write the task descriptions and set roles for the team. However, the results from my tests did not quite match my expectations, even though I used the top-rated model, Claude. I think reasons are as follows:
- The task descriptions still need improvement. Writing a script is challenging, so clear and easy-to-understand descriptions are essential. For example, I should provide a precise description of what each episode should entail and inform the agent teams about the number of episodes required for this task.
- While advanced Language Learning Models (LLMs) such as GPT-4 and Claude are highly sophisticated, they are still not suitable for tasks requiring both logic and creativity, such as writing novels and scripts. While many consider writing to be a creative task, it also demands logical coherence. LLMs tend to lose their memory in long narratives. For instance, while writing the fifth episode, a LLM might forget a character introduced in the first episode.
I also made some very interesting experiments this month and I recorded them in the blog.
- The Alice in Wonderland Test shows how high-level Language Learning Models (LLMs) are difficult to handle this simple questions.
- TextGrad is an automated prompt optimization framework designed to help models solve complex and tricky problems. If you're familiar with Pytorch, you'll find it easy to use. It's a powerful tool.
- I also tested the newly released Claude 3.5 Sonnet model this month. As a fan of Claude, I must say it has improved intelligence.
- I've also read some compelling papers that provided insightful information. I will summarize them in separate posts.
Life
This month, there weren't any significant events in my life, except for my girlfriend's unjust dismissal from her job. I've been providing her with emotional support, which has been both challenging and rewarding. It has served as a reminder of the importance of advocating for justice, even in the face of adversity. She has already accused her company of violating labor laws. I hope she wins this case!
- Author:Chengsheng Deng
- URL:https://chengshengddeng.com/article/recap-for-june
- Copyright:All articles in this blog, except for special statements, adopt BY-NC-SA agreement. Please indicate the source!
Relate Posts