Mukund Murali's blog

Exploring Your Cloud with Steampipe and Powerpipe: An Overview with OCI Focus

Mukund Murali

Introduction

This blog post explores Steampipe and Powerpipe, powerful tools for gaining insights into your cloud resources, particularly for Oracle Cloud Infrastructure (OCI). We'll cover the basics of Steampipe, Powerpipe, and their parent company Turbot, the supported cloud providers, and delve specifically into how these tools can be used with OCI and Terraform.

What is Steampipe?

Steampipe is an open-source tool that enables users to query cloud APIs, code repositories, and other services using SQL. It acts as a universal translator, eliminating the need for specialized knowledge of various platforms and allowing you to leverage your existing SQL skills to interrogate your infrastructure. Steampipe essentially turns your cloud resources into a database that can be queried, analysed, and reported on.

What is Powerpipe?

Powerpipe builds upon Steampipe's functionality by providing a visual interface for creating dashboards and reports. It takes the raw data gathered by Steampipe and transforms it into easily understandable visualisations, making it easier to identify trends, potential security risks, and cost optimization opportunities. Powerpipe is particularly beneficial for DevOps, SecOps, and FinOps teams as it provides a consolidated view of cloud resources and their configurations.

What is Turbot?

Steampipe and Powerpipe are products of Turbot, a company that provides cloud governance and automation solutions. Turbot offers both open-source and commercial products based on Steampipe and Powerpipe technology.

Cloud Providers Supported by Steampipe

Steampipe has a wide range of plugins that connect to various cloud providers and services. Some of the major supported cloud providers include:

Alibaba Cloud
Amazon Web Services (AWS)
Azure
DigitalOcean
Google Cloud Platform (GCP)
Heroku
IBM Cloud
Linode
Oracle Cloud Infrastructure (OCI)
Scaleway
Snowflake

Focusing on OCI: Steampipe, Powerpipe, and Terraform

For Oracle Cloud Infrastructure (OCI), Steampipe provides comprehensive coverage through its plugin. This plugin allows users to query various OCI services and resources using SQL. In addition to basic querying, Steampipe for OCI can be used for:

Compliance Checks: The oci_compliance mod allows you to run security and compliance checks against your OCI resources. It supports benchmarks like CIS, helping you identify misconfigurations and potential vulnerabilities.
Insights and Reporting: The oci_insights mod provides pre-built dashboards for visualizing your OCI resources. This helps gain a better understanding of your cloud infrastructure and its configuration.
Cost Optimization: The oci_thrifty mod focuses on finding unused and underutilized resources. This helps optimize your cloud spending and reduce costs.
Terraform is a popular Infrastructure as Code (IaC) tool used to manage and provision cloud resources. The terraform_oci_compliance mod allows you to scan your Terraform code, plans, and state files for potential security misconfigurations before deploying to OCI.

Examples and Use Cases for OCI

Let's illustrate the power of Steampipe with some practical examples for OCI:

Querying OCI Users: You can retrieve details about your OCI users with a simple SQL query like:
```
  select name, id, is_mfa_activated from oci_identity_user;
```
This query retrieves the name, ID, and MFA activation status of all OCI users.
Checking for Publicly Accessible Buckets: To enhance security, you can identify object storage buckets that are publicly accessible using a query like:
```
  select name from oci_objectstorage_bucket where public_access_type != 'NoPublicAccess';
```
This query lists the names of all buckets that allow some form of public access.
Identifying Unused Compute Instances: For cost optimization, you can find compute instances that are potentially unused with a query like:
```
  select display_name, shape, time_created
  from oci_core_instance
  where state = 'Stopped'
  order by time_created desc;
```
This query lists stopped instances, ordered by their creation time, allowing you to investigate instances that have been stopped for a long time.

OCI Insights Dashboard Questions

While the sources don't explicitly list the questions that OCI Insights dashboards answer, they do provide clues about their capabilities. Based on the information in the sources, here's a breakdown of potential questions these dashboards can help answer:

Resource Inventory and Properties:
- How many resources do I have? This basic question can be answered across various resource types, providing a quick overview of your OCI footprint.
- How old are my resources? Understanding the age of resources can be helpful for lifecycle management and identifying potential candidates for decommissioning or upgrades.

Security Posture:

Are there any publicly accessible resources? This is a critical security question, as publicly accessible resources can pose significant risks. OCI Insights dashboards can likely help identify such resources, allowing you to take corrective actions.
Is encryption enabled, and what keys are used for encryption? These dashboards can likely provide insights into the encryption status of resources, helping you assess your data protection measures.

Data Management:

Is versioning enabled? This is particularly relevant for services like object storage, where versioning provides data recovery and protection against accidental deletions. OCI Insights dashboards can potentially show the versioning status of your resources.

Other Potential Insights:

The sources mention that OCI Insights dashboards are available for 10+ services, including Block Storage, Compute, Identity, Object Storage, VCN, and more. Given this breadth of coverage, it's likely that these dashboards can also help answer questions related to:

Resource utilization and performance: Dashboards could potentially show metrics related to CPU utilization, storage usage, network traffic, and other performance indicators.
Cost allocation and trends: Insights into resource usage can be further used to analyse cost allocation and identify potential areas for optimisation.
Compliance with specific standards: While the sources primarily focus on CIS benchmarks, it's possible that OCI Insights dashboards could also address compliance requirements for other standards, depending on the available mods and configurations.

Conclusion and Next Steps

Steampipe and Powerpipe are valuable tools for anyone working with cloud infrastructure, especially OCI. They provide a unified and intuitive way to query, analyZe, and visualize your cloud resources using the familiar SQL language. By leveraging these tools, you can gain a deeper understanding of your cloud infrastructure, improve security and compliance, and optimize costs.

To learn more about Steampipe and Powerpipe:

Visit the Steampipe Hub: Explore the various Steampipe plugins and their documentation.
Visit the Powerpipe Hub: Discover the available Powerpipe mods and see how they can be used to create insightful dashboards.
Explore Turbot's Website: Find information about Turbot's products and solutions built on Steampipe and Powerpipe technology.

Google's NotebookLM: A Second Brain for the AI Age

Mukund Murali

What is NotebookLM? NotebookLM is an AI-powered research tool developed by Google, designed to function as a personalized knowledge base powered by AI.

Features and Capabilities:

Source Grounded AI: Unlike tools like Perplexity that rely on searching the open web, NotebookLM works by uploading your own documents, limiting its knowledge to the information you provide. This eliminates the risk of hallucinations, ensuring that the AI only generates responses grounded in your sources.
Multimodal Support: NotebookLM can handle a variety of file formats, including Google Docs, Google Slides, PDFs, images, and even YouTube videos and audio files. This versatility makes it ideal for research projects involving diverse types of information.
Extensive Context Window: It supports up to 50 sources with up to 200,000 words per file, allowing you to build a knowledge base of up to 4 million words. This is much larger than the context windows of many other AI tools, enabling more comprehensive analysis.
Audio Overview Generation: One of the standout features is its ability to summarize your sources into a podcast-style discussion between two AI voices. This can provide an engaging and efficient way to consume large amounts of information, though it's important to note that this feature is more prone to hallucinations.
Precise Citations: NotebookLM cites its sources with pinpoint accuracy, highlighting the specific text it's drawing from. This is particularly useful for academic research or any situation where verifying information is crucial.

Use Cases with Examples:

Keeping Up with Niche Information: Upload weekly transcriptions of YouTube videos or podcasts in your industry to get a quick summary and analysis.
Client Relationship Management: Store transcripts of client calls to quickly recall past discussions, identify recurring themes, and gain insights into the relationship.
Market and Competitive Research: Analyze competitor websites and marketing materials to uncover their messaging, target audience, and potential gaps in their approach.
Content Research:
- YouTube Content Research: Analyze top-performing YouTube videos on a topic to identify common themes, effective hooks, and unique angles.
- Content Optimization: Analyze the top-ranking articles for a target keyword to identify main topics covered, semantic keywords, and opportunities to make your content more comprehensive and relevant.
Turning a Blog Article into a Podcast: Upload a blog article, generate an audio overview, and edit the transcript to create a podcast script in your own voice.
Audience Research: Analyze LinkedIn comments on posts by influential figures in your niche to understand audience desires, pain points, and potential content ideas.
Academic Paper Summarization: Upload a complex academic paper and ask the AI to summarize key findings, explain concepts in simpler terms, or extract specific information.
Meeting Notes Analysis: Upload meeting transcripts to quickly extract key points, identify action items, and generate summaries or proposals.
Writing Assistance:
- Idea Generation: Use NotebookLM to brainstorm related ideas or avenues of research based on your existing notes.
- Outlining and Structuring: Generate outlines, key points, and suggestions for improving your writing.
- Drafting and Rewriting: Ask the AI to rewrite your drafts based on specific instructions or sources.

Limitations:

Beta Stage: Being in beta, NotebookLM can have occasional bugs and limitations.
Limited Source Integration: Currently, it only supports uploading Google Docs and PDFs. You can't directly connect to other note-taking apps.
Source Limit: The maximum of 20 sources might require you to consolidate documents or manually copy and paste text if you have a large number of sources.
Challenges with Specific Tasks: Like many AI tools, NotebookLM can struggle with complex math, poorly formatted PDFs, and retrieving highly specific details.

Data Privacy:

Google Terms of Service Apply: Your use of NotebookLM is governed by Google's Terms of Service.
No Personal Data Used for Training: Google emphasizes that your personal data is never used to train NotebookLM.
Feedback Review for Consumer Accounts: If you choose to provide feedback while logged in with a consumer Google account, human reviewers may access your queries, uploads, and the AI's responses.
Privacy for Workspace Users: For Google Workspace or Google Workspace for Education users, your interactions with NotebookLM are not reviewed by humans and are not used for AI model training.

Conclusion:

NotebookLM presents a powerful new way to research, learn, and create using your own knowledge base. By combining the power of large language models with a focus on source grounded AI, it offers a level of control and accuracy not found in traditional search engines or AI chatbots. While still in its early stages, NotebookLM has the potential to revolutionize how we work with information.

Contrasting NotebookLM with ChatGPT and Perplexity

Here's a comparison of NotebookLM with ChatGPT like LLMs and Perplexity like answer engines, highlighting their key differences:

NotebookLM

Focus: Acts as a personal AI research assistant, enabling users to interact with their uploaded documents.
AI Model: Powered by Google's Gemini 1.5 Pro.
Data Source: Utilizes a user-defined knowledge base, limited to the information within uploaded documents (up to 50 sources with a maximum of 4 million words).
Key Features:
- Supports diverse file formats like Google Docs, PDFs, images, YouTube videos, and audio files.
- Generates podcast-style audio summaries of source materials.
- Provides precise citations, linking responses back to the specific text in the uploaded documents.
Strengths:
- Source-grounded responses: Eliminates hallucinations by restricting the AI to user-provided information.
- Large context window: Facilitates comprehensive analysis by accommodating a vast knowledge base.
- Privacy-focused: Personal data is not used for training, and options exist for limiting human review of interactions.
Limitations:
- Restricted to Google Docs and PDF uploads, lacking direct integration with other note-taking apps.
- Limited to 20 sources, requiring manual consolidation for larger datasets.
- May face challenges with specific tasks like complex math or poorly formatted PDFs.

ChatGPT

Focus: Functions as a general-purpose conversational AI chatbot, capable of engaging in open-ended conversations and creative tasks.
AI Model: Powered by OpenAI's large language models (e.g., GPT-3.5, GPT-4).
Data Source: Trained on a massive dataset of text and code scraped from the internet, enabling it to generate human-like text on a wide range of topics.
Key Features:
- Excels in creative writing, text summarization, translation, code generation, and answering general knowledge questions.
Strengths:
- Vast knowledge base: Access to a broad range of information allows for responses on diverse subjects.
- Creative capabilities: Generative abilities make it suitable for tasks requiring imagination and language fluency.
Limitations:
- Prone to hallucinations: Can generate inaccurate or fabricated information due to its reliance on internet data.
- Limited control over sources: Users cannot restrict the AI to specific sources, making fact-checking more challenging.

Perplexity

Focus: Designed as an AI-powered search engine that provides answers to user queries alongside citations.
AI Model: Leverages large language models for understanding and responding to queries.
Data Source: Retrieves information from the internet, relying on its web search capabilities to answer questions.
Key Features:
- Presents answers in a concise and informative manner, often including bullet points and key takeaways.
- Cites sources to provide transparency and allow for verification.
Strengths:
- Real-time information: Accesses up-to-date information from the web.
- Focus on source attribution: Emphasizes transparency by providing links to the sources used in its responses.
Limitations:
- Potential for bias: Results can be influenced by the ranking algorithms of search engines.
- Less control over knowledge scope: Users have less control over the specific sources the AI utilizes compared to NotebookLM.

Key Differences in a Nutshell:

Feature	NotebookLM	ChatGPT	Perplexity
Data Source	User-uploaded documents	Massive internet dataset	Web search results
Hallucinations	Eliminated	Prone	Possible
Source Control	High (user-defined)	Low	Moderate (can specify links)
Context Window	Large (up to 4 million words)	Variable, depending on the model	Limited by search engine capabilities
Audio Summaries	Yes	No	No

In summary, NotebookLM distinguishes itself through its focus on source-grounded AI, enabling users to have focused conversations with their own knowledge base. This offers a level of accuracy and privacy not found in ChatGPT or Perplexity, making it a valuable tool for researchers, writers, and anyone seeking to leverage AI with their personal or professional data.

Inbound NAT with Palo Alto in OCI

Mukund Murali

This topic is with reference in accessing private resources like Load balancer through the untrust Vnic of NVA like Palo Alto. This is a general scenario when we have a firewall like Palo Alto inside OCI for monitoring North-south traffic. Also there are applications inside OCI which needs to be exposed to the internet for internet users to access it and we might need to use a reverse Proxy like Load balancer to expose it.

The problem in this scenario is, For monitoring all the north-south outbound traffic i.e. the egress internet traffic from the instances has to be sent to the Palo Alto so the ingress traffic cannot be directly to the Load balancer via the internet gateway because that would be asymmetric routing as the return traffic will be through Palo Alto. So the ingress traffic has to come through Untrust Vnic of the Palo Alto as that is the exit interface as well.

Architecture

NAT Configuration inside Palo Alto

You can configure a direct D-NAT for this configuration to translate the untrust IP to the load balancer IP but the Palo alto just drops this traffic at the untrust VNIC itself and doesn't capture this traffic. The reason being forward session is not getting created. The solution for this is to create a SNAT with the Load balancer as the source and the untrust NIC as NAT IP with Bi-Directional checkbox checked.

Palo Alto Configuration

Create the NAT policy :

Select Policies > NAT and click Add
On the General tab, enter a descriptive Name for the NAT rule.
On the Original Packet tab, select the zone you created for your DMZ in the Source Zone section (click Add and then select the zone) and the zone you created for the external network from the Destination Zone list.
In the Source Address section, Add the address object you created for your internal web server address.
On the Translated Packet tab, select Static IP from the Translation Type list in the Source Address Translation section and then select the address object you created for your external web server address from the Translated Address list.
In the Bi-directional field, select Yes.
Click OK.
Click Commit.

This type of architecture is not limited to OCI and can be applied to other cloud services as well.

Prompt Engineering 101 - Getting Started

Mukund Murali

Introduction

We already know what ChatGPT or Google's Bard is and how it helps us. In a nutshell, we give an input to these AI models and it gives us an corresponding output. This output can be relatively correct or incorrect to some instances. This depends upon the input we give and this is called the prompt.

The process of creating an efficient prompt is called as prompt engineering.

We will see what are the different types of Prompt Engineering techniques that we can leverage.

Prompt Principles

There are 2 principles which we need to take into consideration while working on it.

Write clear and specific instructions.
- Using Delimiters : Prompting is just like asking a person something, we need to be clear and specific in what we want. In Prompt Engineering we are going to specify this with the help of delimiters (""", ```, < >, ) which are basically used to separate the section inside prompt. Delimiters are also used to prevent any kind of prompt injection.
- Ask for a structured output.
- Check if conditions are met.
- Few short prompting, by giving few examples.
Give the model time to think.
- Specify the set of steps to complete a task.
- Instruct the model to work out its own solution before rushing to conclusion.

Hallucination

At times there will be scenario where the AI model will give incorrect answers but it never knows that its a wrong answer and the model assumes that this is the correct answer for the prompt given. This is often referred to as Hallucinations in AI.

Iterative Process

Give the idea as prompt.

Understand the output that is generated from that prompt.
Analyze if the desired output is generated or not.
Make the necessary changes in the prompt and re-input it as new prompt.

Summarizing Text

Summarizing text prompt is generally used to brief a large set of text in few words to give its complete context. We can summarize a text with respect to specific context as well which later can be used as a email's body. for example: if you get a summary of a overall shipping operation then you can summarize the data with specific key values like the dates mentioned in it.

Inferring learning

Its a task where the model takes the input text and analyze it by extracting label and values using prompt engineering.

Sentiment Analysis

Sentiment analysis is a type of Inferring where you can take a text and understand on what level of emotion that text was written. This is generally used in product review analysis.

Zero-Shot Learning

In short, Zero-shot learning algorithm is getting an output without giving any example of labelled data to the prompt.

Expanding

This is right opposite to what summarize text does, you give a set of instructions or steps to the AI model and the model will generate a large set of text for it. Generally used in generating email messages.

Chatbot

The most exciting use case of the LLM model is to use it to build a customer chatbot and make it act as a customer service agent to answer the questions of user input.

This is achieved with the help of a helper function which says the AI model to have a conversational interaction with the user to store the user inputs in its memory.

Prompt engineering when used properly will help you extracting the best potential from LLM model and should be used in a responsible way which help in solving real time problems.

Mukund Murali's blog

Exploring Your Cloud with Steampipe and Powerpipe: An Overview with OCI Focus

Introduction

What is Steampipe?

What is Powerpipe?

What is Turbot?

Cloud Providers Supported by Steampipe

Focusing on OCI: Steampipe, Powerpipe, and Terraform

Examples and Use Cases for OCI

OCI Insights Dashboard Questions

Conclusion and Next Steps

Google's NotebookLM: A Second Brain for the AI Age

Contrasting NotebookLM with ChatGPT and Perplexity

NotebookLM

ChatGPT

Perplexity

Inbound NAT with Palo Alto in OCI

Architecture

NAT Configuration inside Palo Alto

Palo Alto Configuration

Create the NAT policy :

Prompt Engineering 101 - Getting Started

Introduction

Prompt Principles

Hallucination

Iterative Process

Summarizing Text

Inferring learning

Sentiment Analysis

Zero-Shot Learning

Expanding

Chatbot