Multi-turn Conversations with MiniCPM-V: Exploring Single Image Understanding

by JamePeng2023 - opened Apr 14, 2024

Discussion

JamePeng2023

Apr 14, 2024

Hello openbmb team, can MiniCPM-V engage in multi-turn conversations about a single image?

finalf0

Apr 15, 2024

•

edited Apr 16, 2024

# First round chat 
msgs = [{"role": "user", "content": "Where should I go to buy a camera?"}]
res, context, _ = model.chat(
    image=image,
    msgs=msgs,
    tokenizer=tokenizer
)
print(res)

# Second round chat ,  append history context to msgs
msgs.append({"role": "assistant", "content": res})
msgs.append({"role": "user", "content": "Where is this store in the image?"})

res, context, _ = model.chat(
    image=image,
    msgs=msgs,
    tokenizer=tokenizer
)
print(res)

JamePeng2023

Apr 15, 2024

It's really a very interesting method. I get the last picture from the chat history and transfer it to the conversation with the big model.

finalf0

Apr 19, 2024

A bug fixed, msgs would be changed after calling model.chat(), please pull the latest file modeling_minicpmv.py

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment