type
status
date
slug
summary
tags
category
password
icon
Author
Abstract

0.Introduction

In this blog and future entries titled "Issues #", I'll highlight AI news that catches my interest. I'm writing this series because, while there's an abundance of information and hype in today's AI world, I don't need to document everything. That's why careful selection is crucial. These blogs aren't meant to be the polished, comprehensive newsletters you'd find elsewhere. Instead, they're simply collections of information that intrigues me, along with some of my personal thoughts. It's a rough, unstructured approach—more of a casual exploration than a formal report.

1.Google

1-1 New in models

Google has updated several models in their AI Studio, offering users more choices. You can explore these options in the Google AI Studio.
  • A new smaller 8B variant model: gemini-1.5-flash-8b-exp-0827
  • A new, more powerful Gemini 1.5 Pro model: gemini-1.5-pro-exp-0827
  • A new, enhanced flash model: gemini-1.5-flash-exp-0827
According to Logan Kilpatrick, the gemini-1.5-pro-exp-0827 model performs better on coding tasks and complex prompts. Here's the source:
I tested the gemini-1.5-pro-exp-0827 model and found that the output often repeats itself multiple times. This is not ideal at present. Below is my test output:
notion image
notion image
I tested it in both the Google AI Studio (left) and the API version (right). The results were identical.

1-2 New in Gemini App

Imagen 3 and Gems are rolling out in the Gemini App. Gemini Advanced subscribers will soon be able to create custom Gems. Moreover, all users will gain access to enhanced image capabilities powered by the latest Imagen 3 model.
Here's a demonstration:
notion image
notion image
In my opinion, I prefer Midjourney and Flux over Imagen 3. Midjourney emphasizes aesthetics, while Flux, being open-source, allows me to train a LORA model and explore more possibilities.

2.Zhipu AI

Zhipu AI has also launched several new features this week on KOD:
  • New LLM: GLM-4-Plus
  • New image generation model: CogView-3-Plus
  • New image/video understanding model: GLM-4V-Plus
  • New video generation model: CogVideoX
Below are my test results for GLM-4-Plus
notion image
notion image
notion image
I tested three very tricky questions, but glm-4-plus answered all of them incorrectly. This didn't meet my expectations.
Another update from Zhipu AI is that they've made glm-4-flash free. However, there may be some rate limits for free users. It's advisable to read their documentation carefully.

3.Qwen

The Qwen team announced the release of Qwen2-VL this week. They've open-sourced Qwen2-VL-2B and Qwen2-VL-7B under the Apache 2.0 license and provided the API for Qwen2-VL-72B. Here are some resources to learn more:
 

4. Runway

Runway has removed Stable Diffusion 1.5 from Hugging Face. You can verify this at the following link:

5. Claude System Prompt

 
Loading...