NVIDIA: Llama 3.1 Nemotron 70B Instruct

131K

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains. Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).

OpenRouter 원가 (1M 토큰)

입력$1.20

출력$1.20

합계$2.40

인보이스랩 (월 예상)

입력 0.5M + 출력 0.5M 기준

공급가액₩1,976

부가세 (10%)₩198

결제 금액₩2,174

💡 부가세 ₩198 매입공제 가능

모델 정보

기본 정보

모델 ID	nvidia/llama-3.1-nemotron-70b-instruct
제공사	NVIDIA
컨텍스트 윈도우	131,072 토큰
모달리티	text->text

지원 기능

⚡ Function Calling📄 JSON Mode🌡️ Temperature📏 Max Tokens

API 사용법

Python (OpenAI SDK 호환)

from openai import OpenAI

client = OpenAI(
    api_key="your-dream-api-key",
    base_url="https://api.invoicedream.co.kr/v1"
)

response = client.chat.completions.create(
    model="nvidia/llama-3.1-nemotron-70b-instruct",
    messages=[
        {"role": "user", "content": "안녕하세요"}
    ]
)

print(response.choices[0].message.content)

Node.js / TypeScript

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: 'your-dream-api-key',
  baseURL: 'https://api.invoicedream.co.kr/v1'
});

const response = await client.chat.completions.create({
  model: 'nvidia/llama-3.1-nemotron-70b-instruct',
  messages: [{ role: 'user', content: '안녕하세요' }]
});

console.log(response.choices[0].message.content);

cURL

curl https://api.invoicedream.co.kr/v1/chat/completions \
  -H "Authorization: Bearer your-dream-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nvidia/llama-3.1-nemotron-70b-instruct",
    "messages": [{"role": "user", "content": "안녕하세요"}]
  }'

💡 Tip: OpenAI SDK를 그대로 사용할 수 있습니다.base_url만 변경하면 됩니다!